Machine learningPrivacy-preserving analysis

用于披露控制的合成数据生成

合成数据生成是一种统计披露限制技术，由Donald Rubin于1993年提出。在该技术中，机密数据集中的值被来自已拟合的后验预测分布的样本替换，而不是直接发布。由此产生的合成记录保留了原始数据的联合统计结构，同时防止了真实个体的识别，使分析人员能够使用一个可公开发布的、在大多数推断目的上表现与原始数据相似的数据集。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

Method map

The neighbourhood of related methods — select a node to explore.

用于披露控制的合成数据生成

差分隐私生成对抗网络 Multiple Imputation 披露风险评估 k-匿名化：保护发布数据中的个体隐私

来源

Rubin, D. B. (1993). Statistical disclosure limitation. Journal of Official Statistics, 9(2), 461–468. link ↗

如何引用本页

ScholarGate. (2026, June 2). Synthetic Data Generation for Disclosure Control. ScholarGate. https://scholargate.app/zh/privacy/synthetic-data-generation

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side →

被引用于

差分隐私披露风险评估 k-匿名化：保护发布数据中的个体隐私

发现本页有问题？报告或提出修改建议 →

阅读完整方法

Method map

来源

如何引用本页

相关方法

Which method?

被引用于