ScholarGate
助手
Machine learningDeep learning / NLP / CV

自监督强化学习

自监督强化学习(SSL-RL)通过对智能体自身经验应用自监督辅助目标(例如对比、预测或基于数据增强的任务),来增强标准强化学习训练。这些目标无需额外的人工标签即可提高学习表征的质量,从而实现更快的收敛和更好的样本效率,尤其是在原始像素等高维观测空间中。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

登录

Method map

The neighbourhood of related methods — select a node to explore.

来源

  1. Laskin, M., Srinivas, A., & Abbeel, P. (2020). CURL: Contrastive Unsupervised Representations for Reinforcement Learning. Proceedings of the 37th International Conference on Machine Learning (ICML), PMLR 119, 5639–5650. link
  2. Laskin, M., Lee, K., Stooke, A., Pinto, L., Abbeel, P., & Srinivas, A. (2021). Reinforcement Learning with Augmented Data. Advances in Neural Information Processing Systems (NeurIPS), 33, 19884–19895. link

如何引用本页

ScholarGate. (2026, June 3). Self-supervised Reinforcement Learning (SSL-augmented RL). ScholarGate. https://scholargate.app/zh/deep-learning/self-supervised-reinforcement-learning

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

被引用于

ScholarGateSelf-supervised Reinforcement Learning (Self-supervised Reinforcement Learning (SSL-augmented RL)). 于 2026-06-15 检索自 https://scholargate.app/zh/deep-learning/self-supervised-reinforcement-learning · 数据集: https://doi.org/10.5281/zenodo.20539026