방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 준지도 학습 그래디언트 부스팅× | 준지도학습 랜덤 포레스트× | |
|---|---|---|
| 분야 | 머신러닝 | 머신러닝 |
| 계열 | Machine learning | Machine learning |
| 기원 연도≠ | 2006–2010s | 2009 |
| 창시자≠ | Chapelle, Scholkopf & Zien (eds.); applied to GBM variants in subsequent literature | Leistner, C., Saffari, A., Santner, J., & Bischof, H. |
| 유형≠ | Semi-supervised ensemble (self-training + gradient boosted trees) | Semi-supervised ensemble classifier |
| 원전≠ | Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. Proceedings of ACL 1995, 189–196. (Foundational self-training framework underlying pseudo-label approaches.) link ↗ | Leistner, C., Saffari, A., Santner, J., & Bischof, H. (2009). Semi-supervised random forests. In Proceedings of the IEEE 12th International Conference on Computer Vision (ICCV), pp. 506–513. IEEE. DOI ↗ |
| 별칭 | pseudo-label gradient boosting, self-training GBM, semi-supervised GBT, label-propagation boosting | SSL-RF, semi-supervised forest, label-propagation random forest, self-training random forest |
| 관련≠ | 6 | 3 |
| 요약≠ | Semi-supervised gradient boosting combines gradient boosted trees with self-training or pseudo-labeling to exploit large pools of unlabeled data alongside a small labeled set. An initial GBM fit on labeled data assigns confident predictions to unlabeled examples; those pseudo-labeled points are folded back into training and the model is re-boosted, iterating until convergence. This allows practitioners to harness cheap unlabeled data when labels are scarce or expensive. | Semi-supervised Random Forest (SSL-RF) extends the classic Random Forest by exploiting both labeled and unlabeled training examples. When labeling data is expensive or time-consuming, SSL-RF assigns tentative pseudo-labels to unlabeled observations through the forest itself, then retrains on the enriched dataset, progressively improving accuracy without requiring additional human annotation. |
| ScholarGate데이터셋 ↗ |
|
|