השוואת שיטות
סקרו את השיטות שבחרתם זו לצד זו; שורות שבהן יש הבדל מודגשות.
| k-אנונימיות: הגנה על פרטיות הפרט בנתונים ששוחררו× | פרטיות דיפרנציאלית× | יצירת נתונים סינתטיים לבקרת חשיפה× | |
|---|---|---|---|
| תחום | פרטיות | פרטיות | פרטיות |
| משפחה | Machine learning | Machine learning | Machine learning |
| שנת המקור≠ | 2002 | 2006 | 1993 |
| הוגה השיטה≠ | Latanya Sweeney | Cynthia Dwork | Donald Rubin |
| סוג≠ | Privacy-preserving data transformation | Privacy-preserving randomized mechanism | Privacy-preserving data synthesis |
| מקור מכונן≠ | Sweeney, L. (2002). k-anonymity: A model for protecting privacy. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 10(5), 557–570. DOI ↗ | Dwork, C. (2006). Differential privacy. International Colloquium on Automata, Languages and Programming (ICALP), 1–12. DOI ↗ | Rubin, D. B. (1993). Statistical disclosure limitation. Journal of Official Statistics, 9(2), 461–468. link ↗ |
| כינויים | k-Anonymization, k-Anonymous Microdata, Quasi-Identifier Suppression Model, k-Anonimlik | DP, epsilon-differential privacy, randomized privacy, Diferansiyel Gizlilik | Fully Synthetic Data, Partial Synthetic Data, Statistical Data Synthesis, Sentetik Veri Üretimi |
| קשורות≠ | 2 | 3 | 3 |
| תקציר≠ | k-Anonymity is a formal privacy model introduced by Latanya Sweeney in 2002 to protect individuals when personal data is released for research or public use. It requires that every record in a published dataset be indistinguishable from at least k−1 other records with respect to a designated set of quasi-identifying attributes — such as age, gender, and ZIP code — preventing re-identification by linking released data to external sources. | Differential privacy is a mathematical framework for releasing statistical information about a dataset while providing rigorous guarantees that individual records cannot be identified or inferred. Introduced by Cynthia Dwork in 2006, it formalizes privacy as a probabilistic bound: any single individual's presence or absence in the dataset changes the output distribution by at most a multiplicative factor of e^ε, where ε is the privacy budget controlling the privacy–utility tradeoff. | Synthetic data generation is a statistical disclosure limitation technique introduced by Donald Rubin in 1993, in which values in a confidential dataset are replaced by draws from a fitted posterior predictive distribution rather than released directly. The resulting artificial records preserve the joint statistical structure of the original data while preventing the identification of real individuals, enabling analysts to work with a publicly releasable dataset that behaves like the original for most inferential purposes. |
| ScholarGateמערך נתונים ↗ |
|
|
|