Machine learningDeep learning / NLP / CV

Self-supervised Reinforcement Learning

Self-supervised Reinforcement Learning (SSL-RL) augments standard RL training with self-supervised auxiliary objectives — such as contrastive, predictive, or data-augmentation-based tasks — applied to the agent's own experience. These objectives improve the quality of learned representations without requiring extra human labels, enabling faster convergence and better sample efficiency, especially in high-dimensional observation spaces like raw pixels.

MethodMind'de açSoonVideoSoon

Tam yöntemi oku

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Laskin, M., Srinivas, A., & Abbeel, P. (2020). CURL: Contrastive Unsupervised Representations for Reinforcement Learning. Proceedings of the 37th International Conference on Machine Learning (ICML), PMLR 119, 5639–5650. link
  2. Laskin, M., Lee, K., Stooke, A., Pinto, L., Abbeel, P., & Srinivas, A. (2021). Reinforcement Learning with Augmented Data. Advances in Neural Information Processing Systems (NeurIPS), 33, 19884–19895. link

Related methods

Referenced by

ScholarGateSelf-supervised Reinforcement Learning (Self-supervised Reinforcement Learning (SSL-augmented RL)). Retrieved 2026-06-04 from https://scholargate.app/tr/deep-learning/self-supervised-reinforcement-learning