ScholarGate
Assistant

Comparer des méthodes

Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.

Apprentissage par renforcement semi-supervisé×Apprentissage par renforcement faiblement supervisé×
DomaineApprentissage profondApprentissage profond
FamilleMachine learningMachine learning
Année d'origine2020s2010s–present
Auteur d'origineMultiple contributors (Laskin, Srinivas, Abbeel et al.)Multiple contributors; reward-learning framing: Christiano et al. (2017)
TypeSemi-supervised training paradigm for RL agentsReinforcement learning with imperfect or partial reward supervision
Source fondatriceZhan, X., Zhu, X., & Shi, H. (2022). Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(4), 4680–4688. link ↗Sutton, R. S. & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT Press. ISBN: 978-0-262-03924-6
AliasSSRL, semi-supervised RL, RL with unlabeled data, label-efficient reinforcement learningWSRL, weak-reward RL, imperfect-reward reinforcement learning, reward-impoverished RL
Apparentées63
RésuméSemi-supervised reinforcement learning (SSRL) combines standard reinforcement learning — where an agent learns from sparse reward signals — with semi-supervised techniques that extract structure from unlabeled environment interactions. The goal is to improve sample efficiency and generalization when reward feedback is costly, delayed, or available only for a fraction of the agent's experience.Weakly supervised reinforcement learning (WSRL) trains agents in environments where the reward signal is imperfect, sparse, delayed, or only partially informative — unlike dense fully-supervised RL. The agent must learn effective policies despite incomplete feedback, using auxiliary signals, reward modeling, or preference learning to compensate for the weak supervision.
ScholarGateJeu de données
  1. v1
  2. 2 Sources
  3. PUBLISHED
  1. v1
  2. 2 Sources
  3. PUBLISHED

Aller à la recherche Télécharger les diapositives

ScholarGateComparer des méthodes: Semi-supervised Reinforcement Learning · Weakly supervised reinforcement learning. Consulté le 2026-06-17 sur https://scholargate.app/fr/compare