Bandingkan kaedah
Semak kaedah pilihan anda secara bersebelahan; baris yang berbeza akan diserlahkan.
| GRU Kendiri-Terawasi× | Long Short-Term Memory (LSTM)× | |
|---|---|---|
| Bidang | Pembelajaran Mendalam | Pembelajaran Mendalam |
| Keluarga | Machine learning | Machine learning |
| Tahun asal≠ | 2014–2019 | 1997 |
| Pengasas≠ | Cho, K. et al. (GRU); self-supervised training paradigm from broader SSL literature | Hochreiter, S. & Schmidhuber, J. |
| Jenis≠ | Self-supervised sequence model | Recurrent neural network with gated memory cells |
| Sumber perintis≠ | Cho, K., van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In Proceedings of EMNLP 2014. link ↗ | Hochreiter, S. & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780. DOI ↗ |
| Alias | SS-GRU, Self-supervised Gated Recurrent Unit, GRU with self-supervised pretraining, Unsupervised GRU pretraining | LSTM, LSTM network, LSTM-RNN, long short-term memory RNN |
| Berkaitan | 4 | 4 |
| Ringkasan≠ | Self-supervised GRU trains a Gated Recurrent Unit network using automatically constructed supervision signals — such as next-step prediction or masked token recovery — derived from the unlabeled data itself. The learned sequence representations are then fine-tuned on small labeled datasets, making high-quality sequential modeling feasible when annotations are scarce. | Long Short-Term Memory (LSTM) is a gated recurrent neural network architecture introduced by Hochreiter and Schmidhuber in 1997. It was designed to learn dependencies across long sequences by using dedicated memory cells and three learned gates — forget, input, and output — that control what information is retained, updated, or passed forward at each time step. |
| ScholarGateSet data ↗ |
|
|