Machine learningDeep learning / NLP / CV
迁移学习与强化学习 (Transfer RL) 是一种训练范式,其中代理在一个或多个源任务中获得的知识——编码为策略权重、价值函数或学习到的表示——被重新用于加速或改进相关但不同的目标任务的学习。它直接解决了从头开始在复杂或昂贵的环境中进行强化学习所带来的样本效率低下的问题。
强化学习需要代理探索环境、接收奖励,并通过许多回合来调整其行为——这是一个既缓慢又昂贵的过程。迁移 RL 通过让代理获得一个“先发优势”来缩短这个过程:它不是从零开始学习,而是从一个在类似任务中已有经验的策略或价值函数开始。代理仍然需要适应,但表示学习的艰巨工作已经部分完成,这就像一位已经理解了战术的国际象棋棋手在学习新的开局一样。
阅读完整方法
仅限会员
登录使用免费账户登录即可阅读本节。
Method map
The neighbourhood of related methods — select a node to explore.
来源
- Taylor, M. E., & Stone, P. (2009). Transfer Learning for Reinforcement Learning Domains: A Survey. Journal of Machine Learning Research, 10, 1633–1685. link ↗
- Lazaric, A. (2012). Transfer in Reinforcement Learning: A Framework and a Survey. In M. Wiering & M. van Otterlo (Eds.), Reinforcement Learning: State-of-the-Art (pp. 143–173). Springer. link ↗
如何引用本页
ScholarGate. (2026, June 3). Transfer Learning Applied to Reinforcement Learning. ScholarGate. https://scholargate.app/zh/deep-learning/transfer-learning-reinforcement-learning
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
Compare side by side →