Process / pipelineSimulation / optimization
Multi-objective Markov Model — Sequential Decision-Making Across Competing Objectives
A Multi-objective Markov Model (MOMDP) extends classical Markov Decision Processes to settings where an agent must optimize several reward signals simultaneously. Instead of a single optimal policy, the model produces a Pareto-optimal set of policies, enabling decision-makers to navigate trade-offs between competing goals such as cost, risk, and throughput over time.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Roijers, D. M., Vamplew, P., Whiteson, S., & Dazeley, R. (2013). A survey of multi-objective sequential decision-making. Journal of Artificial Intelligence Research, 48, 67–113. DOI: 10.1613/jair.3987 ↗
- Chatterjee, K., Majumdar, R., & Henzinger, T. A. (2006). Markov decision processes with multiple objectives. In Proceedings of STACS 2006, Lecture Notes in Computer Science, vol. 3884, pp. 325–336. Springer, Berlin. DOI: 10.1007/11672142_26 ↗