ScholarGate
Asistent

Compară metode

Examinează metodele selectate una lângă alta; rândurile care diferă sunt evidențiate.

Învățare prin consolidare semi-supervizată×Învățare prin consolidare slab supervizată×
DomeniuÎnvățare profundăÎnvățare profundă
FamilieMachine learningMachine learning
Anul apariției2020s2010s–present
Autorul originalMultiple contributors (Laskin, Srinivas, Abbeel et al.)Multiple contributors; reward-learning framing: Christiano et al. (2017)
TipSemi-supervised training paradigm for RL agentsReinforcement learning with imperfect or partial reward supervision
Sursa seminalăZhan, X., Zhu, X., & Shi, H. (2022). Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(4), 4680–4688. link ↗Sutton, R. S. & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT Press. ISBN: 978-0-262-03924-6
Denumiri alternativeSSRL, semi-supervised RL, RL with unlabeled data, label-efficient reinforcement learningWSRL, weak-reward RL, imperfect-reward reinforcement learning, reward-impoverished RL
Înrudite63
RezumatSemi-supervised reinforcement learning (SSRL) combines standard reinforcement learning — where an agent learns from sparse reward signals — with semi-supervised techniques that extract structure from unlabeled environment interactions. The goal is to improve sample efficiency and generalization when reward feedback is costly, delayed, or available only for a fraction of the agent's experience.Weakly supervised reinforcement learning (WSRL) trains agents in environments where the reward signal is imperfect, sparse, delayed, or only partially informative — unlike dense fully-supervised RL. The agent must learn effective policies despite incomplete feedback, using auxiliary signals, reward modeling, or preference learning to compensate for the weak supervision.
ScholarGateSet de date
  1. v1
  2. 2 Surse
  3. PUBLISHED
  1. v1
  2. 2 Surse
  3. PUBLISHED

Mergi la căutare Descarcă prezentarea

ScholarGateCompară metode: Semi-supervised Reinforcement Learning · Weakly supervised reinforcement learning. Preluat la 2026-06-17 de pe https://scholargate.app/ro/compare