ScholarGate
Msaidizi

Linganisha mbinu

Pitia mbinu ulizochagua bega kwa bega; safu zinazotofautiana zinaangaziwa.

Msisimko wa usaidizi wa kujifunza (Semi-supervised Reinforcement Learning)×Jifunze la Uimarishaji la Usimamizi dhaifu×
NyanjaUjifunzaji wa KinaUjifunzaji wa Kina
FamiliaMachine learningMachine learning
Mwaka wa asili2020s2010s–present
MwanzilishiMultiple contributors (Laskin, Srinivas, Abbeel et al.)Multiple contributors; reward-learning framing: Christiano et al. (2017)
AinaSemi-supervised training paradigm for RL agentsReinforcement learning with imperfect or partial reward supervision
Chanzo asiliaZhan, X., Zhu, X., & Shi, H. (2022). Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(4), 4680–4688. link ↗Sutton, R. S. & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT Press. ISBN: 978-0-262-03924-6
Majina mbadalaSSRL, semi-supervised RL, RL with unlabeled data, label-efficient reinforcement learningWSRL, weak-reward RL, imperfect-reward reinforcement learning, reward-impoverished RL
Zinazohusiana63
MuhtasariSemi-supervised reinforcement learning (SSRL) combines standard reinforcement learning — where an agent learns from sparse reward signals — with semi-supervised techniques that extract structure from unlabeled environment interactions. The goal is to improve sample efficiency and generalization when reward feedback is costly, delayed, or available only for a fraction of the agent's experience.Weakly supervised reinforcement learning (WSRL) trains agents in environments where the reward signal is imperfect, sparse, delayed, or only partially informative — unlike dense fully-supervised RL. The agent must learn effective policies despite incomplete feedback, using auxiliary signals, reward modeling, or preference learning to compensate for the weak supervision.
ScholarGateSeti ya data
  1. v1
  2. 2 Vyanzo
  3. PUBLISHED
  1. v1
  2. 2 Vyanzo
  3. PUBLISHED

Nenda kwenye utafutaji Pakua slaidi

ScholarGateLinganisha mbinu: Semi-supervised Reinforcement Learning · Weakly supervised reinforcement learning. Imepatikana 2026-06-17 kutoka https://scholargate.app/sw/compare