Machine learningDeep learning / NLP / CV
Semi-supervised GRU
Semi-supervised GRU applies the Gated Recurrent Unit architecture to settings where only a small fraction of sequential data is labeled. By first pre-training or jointly training on abundant unlabeled sequences — through language modeling, auto-encoding, or consistency regularization — and then fine-tuning on labeled examples, the model exploits the full corpus to learn richer sequence representations than supervised-only training would allow.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Dai, A. M., & Le, Q. V. (2015). Semi-supervised Sequence Learning. Advances in Neural Information Processing Systems (NeurIPS), 28. link ↗
- Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. EMNLP 2014. link ↗