विधियों की तुलना करें
चुनी हुई विधियों की आमने-सामने समीक्षा करें; भिन्नता वाली पंक्तियाँ रेखांकित हैं।
| विभेदक विकास× | डीप रीइन्फोर्समेंट लर्निंग× | |
|---|---|---|
| क्षेत्र≠ | अनुकूलन | गहन अधिगम |
| परिवार≠ | Process / pipeline | Machine learning |
| उद्भव वर्ष≠ | 1997 | 2015 |
| प्रवर्तक≠ | Rainer Storn & Kenneth Price | Mnih, V. et al. (DQN) |
| प्रकार≠ | Population-based stochastic metaheuristic | Sequential decision-making (agent–environment interaction) |
| मौलिक स्रोत≠ | Storn, R. & Price, K. (1997). Differential Evolution – A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. Journal of Global Optimization, 11(4), 341–359. DOI ↗ | Mnih, V. et al. (2015). Human-Level Control through Deep Reinforcement Learning. Nature, 518, 529–533. DOI ↗ |
| उपनाम≠ | DE algorithm, Diferansiyel Evrim (DE), DE optimization | Derin Pekiştirmeli Öğrenme (DQN / PPO / A3C), derin pekiştirmeli öğrenme, deep RL, DRL |
| संबंधित≠ | 5 | 4 |
| सारांश≠ | Differential Evolution (DE), introduced by Rainer Storn and Kenneth Price in 1997, is a population-based stochastic optimisation algorithm designed for continuous parameter spaces. It generates candidate solutions by combining vector differences between existing population members, making it a powerful and parameter-lean alternative to Genetic Algorithms and Particle Swarm Optimisation when the search landscape is non-convex, multimodal, or poorly suited to gradient-based methods. | Deep Reinforcement Learning combines neural networks with reinforcement learning so an agent learns by interacting with an environment, popularised by Mnih and colleagues' 2015 Nature work on human-level Atari control. Instead of learning from a fixed labelled dataset, the agent takes actions, observes rewards, and gradually shapes a policy that maximises long-run return. |
| ScholarGateडेटासेट ↗ |
|
|