ScholarGate
Асистент

Порівняння методів

Переглядайте обрані методи поруч; рядки з відмінностями підсвічено.

LSTM×Згорткова нейронна мережа (класифікація)×Трансформер (НЛП)×
ГалузьГлибоке навчанняГлибоке навчанняГлибоке навчання
РодинаMachine learningMachine learningMachine learning
Рік появи199719982017
Автор методуHochreiter, S. & Schmidhuber, J.LeCun, Y. et al.Vaswani, A. et al.
ТипRecurrent neural network (gated memory cell)Deep neural network (convolutional)Attention-based deep neural network
Основоположне джерелоHochreiter, S. & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780. DOI ↗LeCun, Y., Bottou, L., Bengio, Y. & Haffner, P. (1998). Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE, 86(11), 2278–2324. DOI ↗Vaswani, A. et al. (2017). Attention Is All You Need. NeurIPS. link ↗
Інші назвиLSTM (Uzun Kısa Dönem Bellek Ağı), long short-term memory, LSTM network, recurrent neural network with memory cellsCNN (Evrişimli Sinir Ağı — Sınıflandırma), CNN classification, ConvNet, convolutional network classifierTransformer Modeli (NLP), attention-based language model, self-attention network, transformer NLP
Пов'язані554
ПідсумокLSTM (Long Short-Term Memory) is a recurrent neural network architecture, introduced by Sepp Hochreiter and Jürgen Schmidhuber in 1997, that can learn long-term dependencies in sequential data and is widely used for time-series and sequence prediction. It keeps an internal memory that lets information persist across many time steps.A Convolutional Neural Network (CNN) is a deep learning model, established by LeCun and colleagues in 1998, that learns local patterns directly from images and structured data to classify them. Stacks of convolutional filters discover increasingly abstract features, so manual feature engineering can be largely reduced.The Transformer is an attention-based deep learning model, introduced by Vaswani and colleagues in 2017, that performs text classification, named-entity recognition, and language modelling by letting every token in a sequence attend directly to every other token. It replaced earlier recurrent designs with a self-attention mechanism that processes whole sequences in parallel.
ScholarGateНабір даних
  1. v1
  2. 1 Джерела
  3. PUBLISHED
  1. v1
  2. 1 Джерела
  3. PUBLISHED
  1. v1
  2. 1 Джерела
  3. PUBLISHED

Перейти до пошуку Завантажити слайди

ScholarGateПорівняння методів: LSTM · Convolutional Neural Network · Transformer. Отримано 2026-06-18 з https://scholargate.app/uk/compare