ScholarGate
어시스턴트

방법 비교

선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.

다중 모드 GRU×다중 양식 LSTM×
분야딥러닝딥러닝
계열Machine learningMachine learning
기원 연도2014–20172016
창시자Cho, K. et al. (GRU); adapted to multimodal settings by multiple research groupsRajagopalan et al. and various concurrent works (2016–2018)
유형Recurrent neural network (multimodal variant)Recurrent neural network architecture
원전Cho, K., van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. Proceedings of EMNLP 2014, 1724–1734. link ↗Rajagopalan, S., Tran, L., Rozgic, V., Narayanan, S., Kumar, A., & Ramakrishna, S. (2016). Extending Long Short-Term Memory for Multi-View Structured Learning. In Proceedings of ECCV 2016. Springer. link ↗
별칭MM-GRU, Multimodal Gated Recurrent Unit, Cross-modal GRU, Multi-input GRUMM-LSTM, multimodal recurrent network, multi-input LSTM, multimodal sequence model
관련64
요약Multimodal GRU extends the Gated Recurrent Unit architecture to jointly process sequential data from multiple input modalities — such as text, audio, and video frames — within a single recurrent framework. By fusing modality-specific encodings at the input or hidden-state level, it captures temporal dependencies across heterogeneous data streams and is widely used in multimodal sentiment analysis, video understanding, and audio-visual speech recognition.Multimodal LSTM extends the standard Long Short-Term Memory network to jointly process sequential data from multiple input modalities — such as text, audio, and video — within a unified recurrent architecture. By fusing representations from different sources before or within the LSTM cells, it captures temporal dependencies that span and cross modalities, making it a foundational approach for tasks like sentiment analysis, video captioning, and affective computing.
ScholarGate데이터셋
  1. v1
  2. 2 출처
  3. PUBLISHED
  1. v1
  2. 2 출처
  3. PUBLISHED

검색으로 이동 슬라이드 다운로드

ScholarGate방법 비교: Multimodal GRU · Multimodal LSTM. 2026-06-18에 다음에서 검색함: https://scholargate.app/ko/compare