Сравнение методов
Просматривайте выбранные методы рядом; строки с различиями подсвечены.
| CatBoost× | Градиентный бустинг× | Регуляризованный градиентный бустинг× | |
|---|---|---|---|
| Область | Машинное обучение | Машинное обучение | Машинное обучение |
| Семейство | Machine learning | Machine learning | Machine learning |
| Год появления≠ | 2018 | 2001 | 2001 (gradient boosting); 2016 (explicit L1/L2 regularization in XGBoost) |
| Автор метода≠ | Prokhorenkova, L. et al. (Yandex) | Friedman, J. H. | Chen, T. & Guestrin, C. (building on Friedman, J. H.) |
| Тип≠ | Gradient boosting on decision trees | Ensemble (sequential boosting of decision trees) | Regularized ensemble (additive tree model) |
| Основополагающий источник≠ | Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V. & Gulin, A. (2018). CatBoost: Unbiased Boosting with Categorical Features. In NeurIPS 2018. DOI ↗ | Friedman, J. H. (2001). Greedy Function Approximation: A Gradient Boosting Machine. Annals of Statistics, 29(5), 1189–1232. DOI ↗ | Chen, T. & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. DOI ↗ |
| Другие названия | CatBoost (Categorical Boosting), categorical boosting, ordered boosting, kategorik gradyan artırma | Gradient Boosting (GBM), GBM, gradient boosted trees, gradient boosting machine | penalized gradient boosting, shrinkage-regularized boosting, XGBoost-style regularization, L1/L2 gradient boosting |
| Связанные≠ | 5 | 5 | 6 |
| Сводка≠ | CatBoost is a gradient boosting algorithm, introduced by Prokhorenkova and colleagues at Yandex in 2018, that handles categorical variables natively and uses ordered target encoding to avoid label leakage. By building an additive ensemble of trees while permuting the data order at each iteration, it is often superior to XGBoost and LightGBM on category-heavy data. | Gradient Boosting is an ensemble learning method, formalised by Jerome H. Friedman in 2001, that combines a sequence of weak learners — typically shallow decision trees — so that each new tree is fitted to minimise the residual errors of the trees before it. It is the core algorithm behind popular implementations such as XGBoost, LightGBM and CatBoost. | Regularized gradient boosting extends the classic additive tree ensemble (Friedman 2001) by embedding L1 and L2 penalty terms directly into the training objective, along with a complexity penalty on tree size. Popularized by XGBoost (Chen & Guestrin 2016), this framework reduces overfitting and improves generalization compared to unpenalized boosting, while retaining the method's characteristic accuracy on tabular data. |
| ScholarGateНабор данных ↗ |
|
|
|