Machine learningDeep learning / NLP / CV

自己教師あり強化学習

自己教師あり強化学習（SSL-RL）は、標準的なRLトレーニングに、エージェント自身の経験に適用される対照学習、予測、またはデータ拡張ベースのタスクなどの自己教師あり補助目的を追加します。これらの目的は、追加の人間のラベルを必要とせずに学習済み表現の質を向上させ、特に生のピクセルなどの高次元観測空間での収束の高速化とサンプル効率の向上を可能にします。

MethodMindで開く近日公開動画近日公開Download slides

手法の全文を読む

会員限定

無料アカウントでログインすると、このセクションを読めます。

ログイン

Method map

The neighbourhood of related methods — select a node to explore.

自己教師あり強化学習

強化学習自己教師あり畳み込みニューラルネットワーク Semi-supervised Reinforc…強化学習における転移学習ファインチューニング強化学習マルチモーダル強化学習弱教師あり強化学習

出典

Laskin, M., Srinivas, A., & Abbeel, P. (2020). CURL: Contrastive Unsupervised Representations for Reinforcement Learning. Proceedings of the 37th International Conference on Machine Learning (ICML), PMLR 119, 5639–5650. link ↗
Laskin, M., Lee, K., Stooke, A., Pinto, L., Abbeel, P., & Srinivas, A. (2021). Reinforcement Learning with Augmented Data. Advances in Neural Information Processing Systems (NeurIPS), 33, 19884–19895. link ↗

このページの引用方法

ScholarGate. (2026, June 3). Self-supervised Reinforcement Learning (SSL-augmented RL). ScholarGate. https://scholargate.app/ja/deep-learning/self-supervised-reinforcement-learning

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

強化学習深層学習↔ compare
自己教師あり畳み込みニューラルネットワーク深層学習↔ compare
Semi-supervised Reinforcement Learning深層学習↔ compare
強化学習における転移学習深層学習↔ compare

Compare side by side →

この手法を参照する項目

ファインチューニング強化学習マルチモーダル強化学習 Semi-supervised Reinforcement Learning 弱教師あり強化学習

このページに誤りを見つけましたか?報告・修正提案 →