Linganisha mbinu
Pitia mbinu ulizochagua bega kwa bega; safu zinazotofautiana zinaangaziwa.
| Transformer ya Usimamizi dhaifu× | Transformer yenye usimamizi-nusu× | |
|---|---|---|
| Nyanja | Ujifunzaji wa Kina | Ujifunzaji wa Kina |
| Familia | Machine learning | Machine learning |
| Mwaka wa asili≠ | 2017–2019 | 2018–2019 |
| Mwanzilishi≠ | Multiple contributors (weak supervision paradigm: Zhou 2018; transformer backbone: Vaswani et al. 2017) | Devlin, J. et al. (BERT); broader SSL-Transformer paradigm community |
| Aina≠ | Weakly supervised deep learning | Semi-supervised deep learning |
| Chanzo asilia≠ | Ratner, A., Bach, S. H., Ehrenberg, H., Fries, J., Wu, S., & Re, C. (2017). Snorkel: Rapid training data creation with weak supervision. Proceedings of the VLDB Endowment, 11(3), 269–282. DOI ↗ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT 2019, 4171–4186. DOI ↗ |
| Majina mbadala | WST, weakly supervised attention model, noisy-label transformer, weak supervision with transformers | semi-supervised transformer model, SSL transformer, transformer with self-supervised pre-training, semi-supervised attention model |
| Zinazohusiana | 5 | 5 |
| Muhtasari≠ | Weakly Supervised Transformer combines the representational power of Transformer architectures with weak supervision strategies that exploit noisy, incomplete, or programmatically generated labels — making it possible to train high-quality NLP and vision models when fully annotated datasets are scarce or prohibitively expensive to produce. | Semi-supervised learning with Transformer architectures leverages large quantities of unlabeled data alongside a small labeled set to train powerful sequence models. The dominant pattern — exemplified by BERT — first pre-trains the Transformer on unlabeled data using self-supervised objectives such as masked token prediction, then fine-tunes it on the labeled task. This two-stage approach dramatically reduces the labeled data needed to achieve strong performance. |
| ScholarGateSeti ya data ↗ |
|
|