方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 基于智能体的动态规划× | 随机动态规划× | |
|---|---|---|
| 领域 | 仿真 | 仿真 |
| 方法族 | Process / pipeline | Process / pipeline |
| 起源年份≠ | 1957 (DP); 1990s onward (ABM integration) | 1957 |
| 提出者≠ | Bellman, R. (DP foundation); Tesfatsion, L. et al. (ABM-DP integration) | Bellman, R.; formalized for stochastic settings by Puterman, M. L. |
| 类型≠ | Hybrid simulation-optimization | Sequential optimization under uncertainty |
| 开创性文献 | Bellman, R. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ. ISBN: 9780691079516 | Bellman, R. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ. ISBN: 9780486428093 |
| 别名 | ABDP, Agent-based DP, Multi-agent dynamic programming, ABM-DP | SDP, Markov Decision Process, MDP, Stochastic DP |
| 相关≠ | 5 | 6 |
| 摘要≠ | Agent-based dynamic programming (ABDP) embeds Bellman's dynamic programming framework within individual agents of an agent-based model, enabling each agent to solve sequential, multi-stage decision problems using backward induction or value-function iteration. The result is a population of optimizing agents whose interactions generate emergent system-level behavior. | Stochastic Dynamic Programming (SDP) is a mathematical optimization framework for sequential decision problems where outcomes are partly random. It extends Bellman's principle of optimality to stochastic environments, representing problems as Markov Decision Processes (MDPs) and computing optimal policies by solving recursive value equations over states and time periods. |
| ScholarGate数据集 ↗ |
|
|