ScholarGate
Msaidizi

Linganisha mbinu

Pitia mbinu ulizochagua bega kwa bega; safu zinazotofautiana zinaangaziwa.

Time-MoE×Mchanganyiko wa Wataalamu×
NyanjaUjifunzaji wa KinaUjifunzaji wa Kina
FamiliaMachine learningMachine learning
Mwaka wa asili20242017
MwanzilishiXiaoming Shi et al.Shazeer, N. et al.
AinaSparse mixture-of-experts autoregressive foundation modelSparse neural network architecture (conditional computation)
Chanzo asiliaShi, X., Wang, S., Nie, Y., Li, D., Ye, Z., Wen, Q., & Jin, M. (2024). Time-MoE: Billion-scale time series foundation models with mixture of experts. ICLR 2025. link ↗Shazeer, N. et al. (2017). Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer. ICLR. arXiv:1701.06538 link ↗
Majina mbadalaTime Mixture-of-Experts, Time-MoE Foundation Model, Sparse Time-Series Transformer, Zaman Karışık Uzmanlar ModeliUzman Karışımı (Mixture of Experts — MoE), uzman karışımı, MoE, sparse mixture of experts
Zinazohusiana33
MuhtasariTime-MoE is a billion-scale autoregressive foundation model for universal time-series forecasting, introduced by Shi et al. in 2024 and accepted at ICLR 2025. It combines a decoder-only transformer architecture with sparse Mixture-of-Experts (MoE) feed-forward layers, enabling the model to scale to billions of parameters while activating only a small subset of expert networks per token—dramatically increasing capacity without proportional compute cost.Mixture of Experts (MoE) is a sparse neural-network architecture, introduced by Shazeer and colleagues in 2017 with the sparsely-gated MoE layer, in which only a subset of expert sub-networks is activated for each input. As seen in models such as Switch Transformer and Mixtral, it holds computation cost fixed even as the total parameter count grows.
ScholarGateSeti ya data
  1. v1
  2. 1 Vyanzo
  3. PUBLISHED
  1. v1
  2. 2 Vyanzo
  3. PUBLISHED

Nenda kwenye utafutaji Pakua slaidi

ScholarGateLinganisha mbinu: Time-MoE · Mixture of Experts. Imepatikana 2026-06-19 kutoka https://scholargate.app/sw/compare