Порівняння методів
Переглядайте обрані методи поруч; рядки з відмінностями підсвічено.
| Механізми пропущених даних: MCAR, MAR та MNAR× | MICE× | Множинне імпутування× | |
|---|---|---|---|
| Галузь | Статистика | Статистика | Статистика |
| Родина | Process / pipeline | Process / pipeline | Process / pipeline |
| Рік появи≠ | 1976 | 2011 | 1987 |
| Автор методу≠ | Donald Rubin | Stef van Buuren & Karin Groothuis-Oudshoorn | Donald B. Rubin |
| Тип≠ | Diagnostic / classification framework | Iterative multiple imputation algorithm | Missing-data handling procedure |
| Основоположне джерело≠ | Rubin, D. B. (1976). Inference and missing data. Biometrika, 63(3), 581–592. DOI ↗ | van Buuren, S., & Groothuis-Oudshoorn, K. (2011). mice: Multivariate imputation by chained equations in R. Journal of Statistical Software, 45(3), 1–67. DOI ↗ | Rubin, D.B. (1987). Multiple Imputation for Nonresponse in Surveys. Wiley. DOI ↗ |
| Інші назви≠ | Missing Data Typology, Rubin's Missing Data Framework, Missingness Mechanisms, Kayıp Veri Mekanizmaları | Fully Conditional Specification, Sequential Regression Multivariate Imputation, Chained Equations Imputation, Zincirleme Denklemlerle Çoklu Atama | MICE, Multivariate Imputation by Chained Equations, Çoklu Atama (Multiple Imputation — MICE) |
| Пов'язані≠ | 3 | 3 | 1 |
| Підсумок≠ | Missing data mechanisms, introduced by Donald Rubin in 1976, provide a formal taxonomy for classifying why observations are absent from a dataset. The three categories — Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) — describe the relationship between the probability of missingness and the observed or unobserved values. Identifying the correct mechanism is essential because it determines which analytical strategies preserve valid and unbiased inference. | Multivariate Imputation by Chained Equations (MICE) is an iterative procedure for handling missing data in multivariate datasets. Introduced by Stef van Buuren and Karin Groothuis-Oudshoorn through the R package mice (2011), the algorithm fills each missing variable using a separate regression model conditioned on all other variables, cycling through variables repeatedly until the imputed values converge. The result is m completed datasets that are analysed separately and combined using Rubin's rules. | Multiple Imputation (MI), formally introduced by Donald B. Rubin in 1987, is a principled statistical procedure for handling missing data. Rather than replacing each missing value once, MI fills the gaps m times — each time drawing plausible values from the posterior predictive distribution of the missing data — producing m complete datasets. Each dataset is analysed independently, and the results are combined into a single set of estimates using Rubin's pooling rules. The MICE variant (Multivariate Imputation by Chained Equations), popularised by van Buuren and Groothuis-Oudshoorn (2011), extends the approach to mixed variable types by imputing each variable in turn through a sequence of conditional regression models. |
| ScholarGateНабір даних ↗ |
|
|
|