विधियों की तुलना करें
चुनी हुई विधियों की आमने-सामने समीक्षा करें; भिन्नता वाली पंक्तियाँ रेखांकित हैं।
| सुधारक: लंबी अनुक्रमों के लिए कुशल ट्रांसफार्मर× | Pyraformer× | |
|---|---|---|
| क्षेत्र | गहन अधिगम | गहन अधिगम |
| परिवार | Machine learning | Machine learning |
| उद्भव वर्ष≠ | 2020 | 2022 |
| प्रवर्तक≠ | Nikita Kitaev, Łukasz Kaiser & Anselm Levskaya | Shizhan Liu et al. |
| प्रकार≠ | Memory-efficient attention-based sequence model | Pyramidal self-attention transformer for time-series forecasting |
| मौलिक स्रोत≠ | Kitaev, N., Kaiser, Ł., & Levskaya, A. (2020). Reformer: The efficient transformer. ICLR. link ↗ | Liu, S., Yu, H., Liao, C., Li, J., Lin, W., Liu, A. X., & Dustdar, S. (2022). Pyraformer: Low-complexity pyramidal attention for long-range time series modeling and forecasting. ICLR. link ↗ |
| उपनाम | Efficient Transformer, LSH Transformer, Locality-Sensitive Hashing Transformer, Verimli Dönüştürücü | Pyramidal Attention Transformer, Pyraformer Transformer, Piramit Dikkat Dönüştürücüsü, Low-Complexity Transformer |
| संबंधित≠ | 2 | 3 |
| सारांश≠ | The Reformer is an efficient variant of the Transformer architecture introduced by Kitaev, Kaiser, and Levskaya at ICLR 2020. It addresses the prohibitive O(L²) memory and computational cost of standard self-attention for long sequences. The key innovations are locality-sensitive hashing (LSH) attention, which approximates full attention in O(L log L) time, and reversible residual layers that dramatically reduce activation memory during training. | Pyraformer is a Transformer-based model for long-range time-series forecasting introduced by Liu et al. at ICLR 2022. Its central innovation is a Pyramidal Attention Module (PAM) that organizes tokens into a multi-resolution hierarchy, enabling the model to capture temporal dependencies across multiple scales while keeping time and memory complexity at O(L log L) rather than the quadratic cost of vanilla self-attention. |
| ScholarGateडेटासेट ↗ |
|
|