Machine learningDeep learning / NLP / CV

Multimodal LSTM

Multimodal LSTM เป็นเครือข่าย Long Short-Term Memory (LSTM) แบบมาตรฐานที่ได้รับการพัฒนาขึ้นเพื่อประมวลผลข้อมูลลำดับเวลาจากหลายรูปแบบ (modalities) พร้อมกัน เช่น ข้อความ เสียง และวิดีโอ ภายในสถาปัตยกรรมแบบเวียนซ้ำ (recurrent architecture) ที่รวมเป็นหนึ่งเดียว ด้วยการหลอมรวม (fusing) การนำเสนอข้อมูล (representations) จากแหล่งต่างๆ ก่อนหรือภายในเซลล์ LSTM ทำให้สามารถจับความสัมพันธ์เชิงเวลา (temporal dependencies) ที่ครอบคลุมและเชื่อมโยงข้ามรูปแบบข้อมูลได้ ซึ่งทำให้เป็นแนวทางพื้นฐานสำหรับงานต่างๆ เช่น การวิเคราะห์ความรู้สึก (sentiment analysis) การสร้างคำบรรยายวิดีโอ (video captioning) และการประมวลผลทางอารมณ์ (affective computing).

เปิดใน MethodMindเร็ว ๆ นี้วิดีโอเร็ว ๆ นี้Download slides

อ่านวิธีฉบับเต็ม

สำหรับสมาชิกเท่านั้น

เข้าสู่ระบบด้วยบัญชีฟรีเพื่ออ่านส่วนนี้

เข้าสู่ระบบ

Method map

The neighbourhood of related methods — select a node to explore.

Multimodal LSTM

กลไกการใส่ใจ (Attention…หน่วยความจำแบบวนซ้ำแบบมี…แอลเอสทีเอ็ม Multimodal Transformer GRU หลายรูปแบบ

แหล่งอ้างอิง

Rajagopalan, S., Tran, L., Rozgic, V., Narayanan, S., Kumar, A., & Ramakrishna, S. (2016). Extending Long Short-Term Memory for Multi-View Structured Learning. In Proceedings of ECCV 2016. Springer. link ↗
Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780. DOI: 10.1162/neco.1997.9.8.1735 ↗

วิธีอ้างอิงหน้านี้

ScholarGate. (2026, June 3). Multimodal Long Short-Term Memory Network. ScholarGate. https://scholargate.app/th/deep-learning/multimodal-lstm

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

กลไกการใส่ใจ (Attention Mechanism)การเรียนรู้เชิงลึก↔ compare
หน่วยความจำแบบวนซ้ำแบบมีประตู (Gated Recurrent Unit - GRU)การเรียนรู้เชิงลึก↔ compare
แอลเอสทีเอ็มการเรียนรู้เชิงลึก↔ compare
Multimodal Transformerการเรียนรู้เชิงลึก↔ compare

Compare side by side →

ถูกอ้างอิงโดย

GRU หลายรูปแบบ

พบปัญหาในหน้านี้หรือไม่ แจ้งหรือเสนอการแก้ไข →