Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Évaluation du risque de divulgation× | Génération de données synthétiques pour le contrôle de la divulgation× | |
|---|---|---|
| Domaine | Protection de la vie privée | Protection de la vie privée |
| Famille≠ | Regression model | Machine learning |
| Année d'origine≠ | 1989 | 1993 |
| Auteur d'origine≠ | George Duncan & Diane Lambert | Donald Rubin |
| Type≠ | Probabilistic risk model | Privacy-preserving data synthesis |
| Source fondatrice≠ | Duncan, G. T., & Lambert, D. (1989). The risk of disclosure for microdata. Journal of Business & Economic Statistics, 7(2), 207–217. DOI ↗ | Rubin, D. B. (1993). Statistical disclosure limitation. Journal of Official Statistics, 9(2), 461–468. link ↗ |
| Alias | Microdata Disclosure Risk, Statistical Disclosure Control Risk Estimation, Istatistiksel Açıklama Riski Değerlendirmesi, Re-identification Risk Assessment | Fully Synthetic Data, Partial Synthetic Data, Statistical Data Synthesis, Sentetik Veri Üretimi |
| Apparentées | 3 | 3 |
| Résumé≠ | Disclosure Risk Assessment is a probabilistic framework introduced by Duncan and Lambert (1989) for quantifying how likely it is that releasing microdata — individual-level records from surveys or administrative files — will allow an outside party to identify a specific respondent or infer sensitive attributes. It is used by statistical agencies, data custodians, and researchers charged with protecting confidentiality before any public release of person-level datasets. | Synthetic data generation is a statistical disclosure limitation technique introduced by Donald Rubin in 1993, in which values in a confidential dataset are replaced by draws from a fitted posterior predictive distribution rather than released directly. The resulting artificial records preserve the joint statistical structure of the original data while preventing the identification of real individuals, enabling analysts to work with a publicly releasable dataset that behaves like the original for most inferential purposes. |
| ScholarGateJeu de données ↗ |
|
|