Machine learningDeep learning / NLP / CV
Explainable Reinforcement Learning
Explainable Reinforcement Learning (XRL) augments standard reinforcement learning agents with methods that make their policies, decisions, and learned behaviors interpretable to humans. Rather than treating the policy as a black box, XRL produces post-hoc explanations or builds inherently transparent policies, enabling trust verification, debugging, and accountability in high-stakes automated decision-making.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Puiutta, E., & Veith, E. M. S. P. (2020). Explainable Reinforcement Learning: A Survey. In Machine Learning and Knowledge Extraction (CD-MAKE 2020), Lecture Notes in Computer Science, vol. 12279, pp. 77–95. Springer. DOI: 10.1007/978-3-030-57321-8_5 ↗
- Explainable artificial intelligence. Wikipedia. link ↗