ScholarGate
Asystent

Porównaj metody

Przeglądaj wybrane metody obok siebie; wiersze, które się różnią, są wyróżnione.

Reformer: Efektywny Transformer dla Długich Sekwencji×Informer×Pyraformer×
DziedzinaUczenie głębokieUczenie głębokieUczenie głębokie
RodzinaMachine learningMachine learningMachine learning
Rok powstania202020212022
TwórcaNikita Kitaev, Łukasz Kaiser & Anselm LevskayaZhou, H. et al.Shizhan Liu et al.
TypMemory-efficient attention-based sequence modelTransformer (ProbSparse self-attention)Pyramidal self-attention transformer for time-series forecasting
Źródło pierwotneKitaev, N., Kaiser, Ł., & Levskaya, A. (2020). Reformer: The efficient transformer. ICLR. link ↗Zhou, H. et al. (2021). Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. AAAI. DOI ↗Liu, S., Yu, H., Liao, C., Li, J., Lin, W., Liu, A. X., & Dustdar, S. (2022). Pyraformer: Low-complexity pyramidal attention for long-range time series modeling and forecasting. ICLR. link ↗
Inne nazwyEfficient Transformer, LSH Transformer, Locality-Sensitive Hashing Transformer, Verimli DönüştürücüInformer — Uzun Dizi Transformer Tahmini, Informer transformer, ProbSparse attention forecasterPyramidal Attention Transformer, Pyraformer Transformer, Piramit Dikkat Dönüştürücüsü, Low-Complexity Transformer
Pokrewne253
PodsumowanieThe Reformer is an efficient variant of the Transformer architecture introduced by Kitaev, Kaiser, and Levskaya at ICLR 2020. It addresses the prohibitive O(L²) memory and computational cost of standard self-attention for long sequences. The key innovations are locality-sensitive hashing (LSH) attention, which approximates full attention in O(L log L) time, and reversible residual layers that dramatically reduce activation memory during training.Informer is a Transformer-based model introduced by Zhou et al. in 2021 for long-sequence time-series forecasting, using a ProbSparse self-attention mechanism that lowers the computational complexity of the standard Transformer to O(L log L). It is built for problems that demand predictions across thousands of future steps.Pyraformer is a Transformer-based model for long-range time-series forecasting introduced by Liu et al. at ICLR 2022. Its central innovation is a Pyramidal Attention Module (PAM) that organizes tokens into a multi-resolution hierarchy, enabling the model to capture temporal dependencies across multiple scales while keeping time and memory complexity at O(L log L) rather than the quadratic cost of vanilla self-attention.
ScholarGateZbiór danych
  1. v1
  2. 1 Źródła
  3. PUBLISHED
  1. v1
  2. 2 Źródła
  3. PUBLISHED
  1. v1
  2. 1 Źródła
  3. PUBLISHED

Przejdź do wyszukiwania Pobierz slajdy

ScholarGatePorównaj metody: Reformer · Informer · Pyraformer. Pobrano 2026-06-18 z https://scholargate.app/pl/compare