Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Empilement bayésien (Bayesian stacking)× | Moyenne Bayésienne de Modèles× | Boosting× | |
|---|---|---|---|
| Domaine≠ | Apprentissage automatique | Bayésien | Apprentissage automatique |
| Famille≠ | Machine learning | Bayesian methods | Machine learning |
| Année d'origine≠ | 2018 | 1999 | 1990–1997 |
| Auteur d'origine≠ | Yao, Y.; Vehtari, A.; Simpson, D.; Gelman, A. | Hoeting, Madigan, Raftery & Volinsky | Schapire, R. E.; Freund, Y. |
| Type≠ | Bayesian ensemble combination | Bayesian model averaging | Sequential ensemble (iterative reweighting) |
| Source fondatrice≠ | Yao, Y., Vehtari, A., Simpson, D., & Gelman, A. (2018). Using stacking to average Bayesian predictive distributions. Bayesian Analysis, 13(3), 917–1007. DOI ↗ | Hoeting, J. A., Madigan, D., Raftery, A. E. & Volinsky, C. T. (1999). Bayesian Model Averaging: A Tutorial. Statistical Science, 14(4), 382–401. link ↗ | Freund, Y. & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139. DOI ↗ |
| Alias≠ | Bayesian stacking, Bayesian model stacking, stacking with Bayesian weights, predictive distribution stacking | BMA, Bayesian model combination, Bayesian Model Ortalaması (BMA) | AdaBoost, gradient boosting, iterative reweighting ensemble, sequential ensemble |
| Apparentées≠ | 6 | 5 | 6 |
| Résumé≠ | Bayesian stacking combines the predictive distributions of several base models by finding non-negative weights that maximise the leave-one-out log predictive score of the mixture. Formalised by Yao, Vehtari, Simpson, and Gelman (2018), it yields a single calibrated predictive distribution that is provably at least as good as any single constituent model under cross-validation. | Bayesian Model Averaging (BMA), formalised as a tutorial by Hoeting, Madigan, Raftery and Volinsky in 1999, addresses model uncertainty by averaging over all plausible model specifications rather than selecting a single best model. Each candidate model receives a posterior probability that reflects how well it fits the data given a prior, and predictions or coefficient estimates are formed as weighted averages across the entire model space. This approach reduces the bias and overconfidence that arise when a single selected model is treated as the true one. | Boosting is a sequential ensemble technique that converts many simple, barely-better-than-chance learners into a single highly accurate model by repeatedly focusing training on the examples that previous learners got wrong, then combining all learners with weights proportional to their individual accuracy. |
| ScholarGateJeu de données ↗ |
|
|
|