Process / pipelineSimulation / optimization

다목적 마르코프 모델 — 경쟁 목표 간의 순차적 의사 결정

다목적 마르코프 모델(MOMDP)은 에이전트가 여러 보상 신호를 동시에 최적화해야 하는 설정으로 고전적인 마르코프 결정 과정을 확장합니다. 단일 최적 정책 대신, 이 모델은 파레토 최적 정책 집합을 생성하여 의사 결정자가 시간 경과에 따른 비용, 위험, 처리량과 같은 경쟁 목표 간의 절충점을 탐색할 수 있도록 합니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

다목적 마르코프 모델

마르코프 모델 다중 목표 동적 계획법 다목적 최적화 확률적 동적 계획법 확률적 마르코프 모형

출처

Roijers, D. M., Vamplew, P., Whiteson, S., & Dazeley, R. (2013). A survey of multi-objective sequential decision-making. Journal of Artificial Intelligence Research, 48, 67–113. DOI: 10.1613/jair.3987 ↗
Chatterjee, K., Majumdar, R., & Henzinger, T. A. (2006). Markov decision processes with multiple objectives. In Proceedings of STACS 2006, Lecture Notes in Computer Science, vol. 3884, pp. 325–336. Springer, Berlin. DOI: 10.1007/11672142_26 ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Multi-objective Markov Decision Process Model. ScholarGate. https://scholargate.app/ko/simulation/multi-objective-markov-model

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side →

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →

방법 전문 읽기

Method map

출처

이 페이지 인용 방법

관련 방법

Which method?