방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| k-익명성: 공개 데이터에서 개인 정보 보호× | Synthetic Data Generation× | |
|---|---|---|
| 분야 | 프라이버시 | 프라이버시 |
| 계열 | Machine learning | Machine learning |
| 기원 연도≠ | 2002 | 1993 |
| 창시자≠ | Latanya Sweeney | Donald Rubin |
| 유형≠ | Privacy-preserving data transformation | Privacy-preserving data synthesis |
| 원전≠ | Sweeney, L. (2002). k-anonymity: A model for protecting privacy. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 10(5), 557–570. DOI ↗ | Rubin, D. B. (1993). Statistical disclosure limitation. Journal of Official Statistics, 9(2), 461–468. link ↗ |
| 별칭 | k-Anonymization, k-Anonymous Microdata, Quasi-Identifier Suppression Model, k-Anonimlik | Fully Synthetic Data, Partial Synthetic Data, Statistical Data Synthesis, Sentetik Veri Üretimi |
| 관련≠ | 2 | 3 |
| 요약≠ | k-Anonymity is a formal privacy model introduced by Latanya Sweeney in 2002 to protect individuals when personal data is released for research or public use. It requires that every record in a published dataset be indistinguishable from at least k−1 other records with respect to a designated set of quasi-identifying attributes — such as age, gender, and ZIP code — preventing re-identification by linking released data to external sources. | Synthetic data generation is a statistical disclosure limitation technique introduced by Donald Rubin in 1993, in which values in a confidential dataset are replaced by draws from a fitted posterior predictive distribution rather than released directly. The resulting artificial records preserve the joint statistical structure of the original data while preventing the identification of real individuals, enabling analysts to work with a publicly releasable dataset that behaves like the original for most inferential purposes. |
| ScholarGate데이터셋 ↗ |
|
|