ScholarGate
助手
Process / pipelineSimulation / optimization

多目标动态规划——序贯决策中的帕累托最优策略

多目标动态规划(MODP)将经典的贝尔曼动态规划扩展到决策者需要在多个阶段同时优化几个相互竞争的目标的场景。它不产生单一的最优策略,而是产生一个帕累托最优策略集——每个策略代表一个独特的权衡剖面——通过将向量值函数向后传播通过状态空间。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

登录

Method map

The neighbourhood of related methods — select a node to explore.

来源

  1. Bellman, R. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ. ISBN: 9780691079516
  2. Daellenbach, H. G., & Flood, R. L. (1992). Multi-objective dynamic programming. European Journal of Operational Research, 56(2), 215-225. link

如何引用本页

ScholarGate. (2026, June 3). Multi-Objective Dynamic Programming. ScholarGate. https://scholargate.app/zh/simulation/multi-objective-dynamic-programming

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

被引用于

ScholarGateMulti-objective dynamic programming (Multi-Objective Dynamic Programming). 于 2026-06-15 检索自 https://scholargate.app/zh/simulation/multi-objective-dynamic-programming · 数据集: https://doi.org/10.5281/zenodo.20539026