ScholarGate
Avustaja

Vertaile menetelmiä

Tarkastele valitsemiasi menetelmiä rinnakkain; eroavat rivit korostetaan.

Synteettisen datan generointi tietosuojan hallintaan×Monitahinen imputointi×
TieteenalaYksityisyydensuojaTilastotiede
MenetelmäperheMachine learningProcess / pipeline
Syntyvuosi19931987
KehittäjäDonald RubinDonald B. Rubin
TyyppiPrivacy-preserving data synthesisMissing-data handling procedure
AlkuperäislähdeRubin, D. B. (1993). Statistical disclosure limitation. Journal of Official Statistics, 9(2), 461–468. link ↗Rubin, D.B. (1987). Multiple Imputation for Nonresponse in Surveys. Wiley. DOI ↗
RinnakkaisnimetFully Synthetic Data, Partial Synthetic Data, Statistical Data Synthesis, Sentetik Veri ÜretimiMICE, Multivariate Imputation by Chained Equations, Çoklu Atama (Multiple Imputation — MICE)
Liittyvät31
TiivistelmäSynthetic data generation is a statistical disclosure limitation technique introduced by Donald Rubin in 1993, in which values in a confidential dataset are replaced by draws from a fitted posterior predictive distribution rather than released directly. The resulting artificial records preserve the joint statistical structure of the original data while preventing the identification of real individuals, enabling analysts to work with a publicly releasable dataset that behaves like the original for most inferential purposes.Multiple Imputation (MI), formally introduced by Donald B. Rubin in 1987, is a principled statistical procedure for handling missing data. Rather than replacing each missing value once, MI fills the gaps m times — each time drawing plausible values from the posterior predictive distribution of the missing data — producing m complete datasets. Each dataset is analysed independently, and the results are combined into a single set of estimates using Rubin's pooling rules. The MICE variant (Multivariate Imputation by Chained Equations), popularised by van Buuren and Groothuis-Oudshoorn (2011), extends the approach to mixed variable types by imputing each variable in turn through a sequence of conditional regression models.
ScholarGateAineisto
  1. v1
  2. 1 Lähteet
  3. PUBLISHED
  1. v1
  2. 2 Lähteet
  3. PUBLISHED

Siirry hakuun Lataa diat

ScholarGateVertaile menetelmiä: Synthetic Data Generation · Multiple Imputation. Haettu 2026-06-17 osoitteesta https://scholargate.app/fi/compare