ScholarGate
دستیار

مقایسهٔ روش‌ها

روش‌های انتخابی خود را کنار هم مرور کنید؛ ردیف‌های متفاوت برجسته شده‌اند.

مدل مولد مبتنی بر امتیاز×یادگیری تقویتی عمیق×
حوزهیادگیری عمیقیادگیری عمیق
خانوادهMachine learningMachine learning
سال پیدایش20192015
پدیدآورSong, Y. & Ermon, S.Mnih, V. et al. (DQN)
نوعScore-based generative model (SDE framework)Sequential decision-making (agent–environment interaction)
منبع بنیادینSong, Y. & Ermon, S. (2019). Generative Modeling by Estimating Gradients of the Data Distribution. NeurIPS 32, 11895–11907. link ↗Mnih, V. et al. (2015). Human-Level Control through Deep Reinforcement Learning. Nature, 518, 529–533. DOI ↗
نام‌های دیگرSkor Tabanlı Üretici Model (Score-Based / SDE), score-based diffusion, SDE-based generative model, score SDEDerin Pekiştirmeli Öğrenme (DQN / PPO / A3C), derin pekiştirmeli öğrenme, deep RL, DRL
مرتبط54
خلاصهA score-based generative model, introduced by Yang Song and Stefano Ermon in 2019 and generalized to the stochastic differential equation (SDE) framework in 2021, learns the gradient of the data density — the score — rather than predicting noise directly, and uses it to generate new samples. It is the mathematical generalization that unifies diffusion models under a continuous-time formulation.Deep Reinforcement Learning combines neural networks with reinforcement learning so an agent learns by interacting with an environment, popularised by Mnih and colleagues' 2015 Nature work on human-level Atari control. Instead of learning from a fixed labelled dataset, the agent takes actions, observes rewards, and gradually shapes a policy that maximises long-run return.
ScholarGateمجموعه‌داده
  1. v1
  2. 2 منابع
  3. PUBLISHED
  1. v1
  2. 2 منابع
  3. PUBLISHED

رفتن به جست‌وجو دریافت اسلایدها

ScholarGateمقایسهٔ روش‌ها: Score-Based Generative Model · Deep Reinforcement Learning. بازیابی‌شده در 2026-06-15 از https://scholargate.app/fa/compare