ScholarGate
Assistent
Machine learningDeep learning / NLP / CV

Selvovervåget forstærkningslæring

Selvovervåget forstærkningslæring (SSL-RL) udvider standard RL-træning med selvovervågede hjælpeobjektiver — såsom kontrastive, prædiktive eller dataforstærkningsbaserede opgaver — anvendt på agentens egen erfaring. Disse objektiver forbedrer kvaliteten af indlærte repræsentationer uden at kræve yderligere menneskelige mærkater, hvilket muliggør hurtigere konvergens og bedre prøveeffektivitet, især i højdimensionelle observationsrum som rå pixels.

Åbn i MethodMindSnartVideoSnartDownload slides

Læs hele metoden

Kun for medlemmer

Log ind med en gratis konto for at læse dette afsnit.

Log ind

Method map

The neighbourhood of related methods — select a node to explore.

Kilder

  1. Laskin, M., Srinivas, A., & Abbeel, P. (2020). CURL: Contrastive Unsupervised Representations for Reinforcement Learning. Proceedings of the 37th International Conference on Machine Learning (ICML), PMLR 119, 5639–5650. link
  2. Laskin, M., Lee, K., Stooke, A., Pinto, L., Abbeel, P., & Srinivas, A. (2021). Reinforcement Learning with Augmented Data. Advances in Neural Information Processing Systems (NeurIPS), 33, 19884–19895. link

Sådan citerer du denne side

ScholarGate. (2026, June 3). Self-supervised Reinforcement Learning (SSL-augmented RL). ScholarGate. https://scholargate.app/da/deep-learning/self-supervised-reinforcement-learning

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

Refereret af

ScholarGateSelf-supervised Reinforcement Learning (Self-supervised Reinforcement Learning (SSL-augmented RL)). Hentet 2026-06-15 fra https://scholargate.app/da/deep-learning/self-supervised-reinforcement-learning · Datasæt: https://doi.org/10.5281/zenodo.20539026