ScholarGate
Βοηθός

Σύγκριση μεθόδων

Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.

Time-MoE: Μοντέλο Θεμελίωσης Χρονοσειρών Μείγματος Ειδικών×Chronos: Ένα Tokenized Foundation Model για Πρόβλεψη Χρονοσειρών×Μείγμα Εμπειρογνωμόνων×
ΠεδίοΒαθιά ΜάθησηΒαθιά ΜάθησηΒαθιά Μάθηση
ΟικογένειαMachine learningMachine learningMachine learning
Έτος προέλευσης202420242017
ΔημιουργόςXiaoming Shi et al.Abdul Fatir Ansari et al. (Amazon)Shazeer, N. et al.
ΤύποςSparse mixture-of-experts autoregressive foundation modelPre-trained language-model-based time-series forecasterSparse neural network architecture (conditional computation)
Θεμελιώδης πηγήShi, X., Wang, S., Nie, Y., Li, D., Ye, Z., Wen, Q., & Jin, M. (2024). Time-MoE: Billion-scale time series foundation models with mixture of experts. ICLR 2025. link ↗Ansari, A. F., Stella, L., Turkmen, C., Zhang, X., Mercado, P., Shen, H., et al. (2024). Chronos: Learning the language of time series. Transactions on Machine Learning Research. link ↗Shazeer, N. et al. (2017). Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer. ICLR. arXiv:1701.06538 link ↗
Εναλλακτικές ονομασίεςTime Mixture-of-Experts, Time-MoE Foundation Model, Sparse Time-Series Transformer, Zaman Karışık Uzmanlar ModeliChronos Forecasting Model, Amazon Chronos, Tokenized Time-Series LLM, Kronos Zaman Serisi ModeliUzman Karışımı (Mixture of Experts — MoE), uzman karışımı, MoE, sparse mixture of experts
Συναφείς323
ΣύνοψηTime-MoE is a billion-scale autoregressive foundation model for universal time-series forecasting, introduced by Shi et al. in 2024 and accepted at ICLR 2025. It combines a decoder-only transformer architecture with sparse Mixture-of-Experts (MoE) feed-forward layers, enabling the model to scale to billions of parameters while activating only a small subset of expert networks per token—dramatically increasing capacity without proportional compute cost.Chronos is a family of pre-trained probabilistic forecasting models introduced by Ansari et al. at Amazon in 2024. It adapts the language-model paradigm to time series by quantizing continuous values into discrete tokens, enabling a standard transformer to be trained on a large heterogeneous corpus of time-series data. The result is a zero-shot forecasting model that generalizes across domains without requiring dataset-specific retraining.Mixture of Experts (MoE) is a sparse neural-network architecture, introduced by Shazeer and colleagues in 2017 with the sparsely-gated MoE layer, in which only a subset of expert sub-networks is activated for each input. As seen in models such as Switch Transformer and Mixtral, it holds computation cost fixed even as the total parameter count grows.
ScholarGateΣύνολο δεδομένων
  1. v1
  2. 1 Πηγές
  3. PUBLISHED
  1. v1
  2. 1 Πηγές
  3. PUBLISHED
  1. v1
  2. 2 Πηγές
  3. PUBLISHED

Μετάβαση στην αναζήτηση Λήψη διαφανειών

ScholarGateΣύγκριση μεθόδων: Time-MoE · Chronos · Mixture of Experts. Ανακτήθηκε στις 2026-06-20 από https://scholargate.app/el/compare