Порівняння методів
Переглядайте обрані методи поруч; рядки з відмінностями підсвічено.
| Регуляризований CatBoost× | CatBoost× | Регуляризований градієнтний бустинг× | |
|---|---|---|---|
| Галузь | Машинне навчання | Машинне навчання | Машинне навчання |
| Родина | Machine learning | Machine learning | Machine learning |
| Рік появи≠ | 2018 | 2018 | 2001 (gradient boosting); 2016 (explicit L1/L2 regularization in XGBoost) |
| Автор методу≠ | Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (Yandex Research) | Prokhorenkova, L. et al. (Yandex) | Chen, T. & Guestrin, C. (building on Friedman, J. H.) |
| Тип≠ | Regularized gradient boosting ensemble | Gradient boosting on decision trees | Regularized ensemble (additive tree model) |
| Основоположне джерело≠ | Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). CatBoost: unbiased boosting with categorical features. Advances in Neural Information Processing Systems, 31. link ↗ | Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V. & Gulin, A. (2018). CatBoost: Unbiased Boosting with Categorical Features. In NeurIPS 2018. DOI ↗ | Chen, T. & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. DOI ↗ |
| Інші назви | CatBoost with regularization, regularized categorical boosting, CatBoost L2 regularization, penalized CatBoost | CatBoost (Categorical Boosting), categorical boosting, ordered boosting, kategorik gradyan artırma | penalized gradient boosting, shrinkage-regularized boosting, XGBoost-style regularization, L1/L2 gradient boosting |
| Пов'язані≠ | 5 | 5 | 6 |
| Підсумок≠ | Regularized CatBoost applies explicit regularization controls — L2 leaf regularization, tree depth constraints, shrinkage rate, and model size penalties — on top of CatBoost's ordered gradient boosting framework, reducing overfitting while retaining CatBoost's native handling of categorical features and its low prediction latency on tabular datasets. | CatBoost is a gradient boosting algorithm, introduced by Prokhorenkova and colleagues at Yandex in 2018, that handles categorical variables natively and uses ordered target encoding to avoid label leakage. By building an additive ensemble of trees while permuting the data order at each iteration, it is often superior to XGBoost and LightGBM on category-heavy data. | Regularized gradient boosting extends the classic additive tree ensemble (Friedman 2001) by embedding L1 and L2 penalty terms directly into the training objective, along with a complexity penalty on tree size. Popularized by XGBoost (Chen & Guestrin 2016), this framework reduces overfitting and improves generalization compared to unpenalized boosting, while retaining the method's characteristic accuracy on tabular data. |
| ScholarGateНабір даних ↗ |
|
|
|