So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Đánh giá rủi ro tiết lộ× | Quyền riêng tư vi phân× | Tạo dữ liệu tổng hợp để kiểm soát tiết lộ× | |
|---|---|---|---|
| Lĩnh vực | Quyền riêng tư | Quyền riêng tư | Quyền riêng tư |
| Họ≠ | Regression model | Machine learning | Machine learning |
| Năm ra đời≠ | 1989 | 2006 | 1993 |
| Người khởi xướng≠ | George Duncan & Diane Lambert | Cynthia Dwork | Donald Rubin |
| Loại≠ | Probabilistic risk model | Privacy-preserving randomized mechanism | Privacy-preserving data synthesis |
| Công trình gốc≠ | Duncan, G. T., & Lambert, D. (1989). The risk of disclosure for microdata. Journal of Business & Economic Statistics, 7(2), 207–217. DOI ↗ | Dwork, C. (2006). Differential privacy. International Colloquium on Automata, Languages and Programming (ICALP), 1–12. DOI ↗ | Rubin, D. B. (1993). Statistical disclosure limitation. Journal of Official Statistics, 9(2), 461–468. link ↗ |
| Tên gọi khác | Microdata Disclosure Risk, Statistical Disclosure Control Risk Estimation, Istatistiksel Açıklama Riski Değerlendirmesi, Re-identification Risk Assessment | DP, epsilon-differential privacy, randomized privacy, Diferansiyel Gizlilik | Fully Synthetic Data, Partial Synthetic Data, Statistical Data Synthesis, Sentetik Veri Üretimi |
| Liên quan | 3 | 3 | 3 |
| Tóm tắt≠ | Disclosure Risk Assessment is a probabilistic framework introduced by Duncan and Lambert (1989) for quantifying how likely it is that releasing microdata — individual-level records from surveys or administrative files — will allow an outside party to identify a specific respondent or infer sensitive attributes. It is used by statistical agencies, data custodians, and researchers charged with protecting confidentiality before any public release of person-level datasets. | Differential privacy is a mathematical framework for releasing statistical information about a dataset while providing rigorous guarantees that individual records cannot be identified or inferred. Introduced by Cynthia Dwork in 2006, it formalizes privacy as a probabilistic bound: any single individual's presence or absence in the dataset changes the output distribution by at most a multiplicative factor of e^ε, where ε is the privacy budget controlling the privacy–utility tradeoff. | Synthetic data generation is a statistical disclosure limitation technique introduced by Donald Rubin in 1993, in which values in a confidential dataset are replaced by draws from a fitted posterior predictive distribution rather than released directly. The resulting artificial records preserve the joint statistical structure of the original data while preventing the identification of real individuals, enabling analysts to work with a publicly releasable dataset that behaves like the original for most inferential purposes. |
| ScholarGateBộ dữ liệu ↗ |
|
|
|