Jifunze kwa Kuimarisha (Reinforcement Learning)
Jifunze kwa Kuimarisha (RL) ni mfumo ambapo ajenti hujifunza kufanya maamuzi mfululizo kwa kuingiliana na mazingira, kupokea mawimbi ya tuzo ya scalar, na kusasisha sera ili kuongeza tuzo ya baadaye kwa jumla. Tofauti na kujifunza kwa usimamizi, hakuna mifano yenye lebo inayotolewa; ajenti hugundua tabia bora kupitia uzoefu na maoni yaliyocheleweshwa.
Soma mbinu kamili
Ingia kwa akaunti ya bure ili kusoma sehemu hii.
Method map
The neighbourhood of related methods — select a node to explore.
+2 more
Vyanzo
- Sutton, R. S. & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT Press. ISBN: 978-0-262-03924-6
- Mnih, V., Kavukcuoglu, K., Silver, D., et al. (2015). Human-level control through deep reinforcement learning. Nature, 518, 529–533. DOI: 10.1038/nature14236 ↗
Jinsi ya kunukuu ukurasa huu
ScholarGate. (2026, June 3). Reinforcement Learning (Agent-Environment Reward Optimization). ScholarGate. https://scholargate.app/sw/deep-learning/reinforcement-learning
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
- Mbinu za Kielelezo cha SeraUjifunzaji wa Mashine↔ compare
- Mtandao wa Nyuro UnaojirudiaUjifunzaji wa Kina↔ compare
Imerejelewa na
Umeona tatizo kwenye ukurasa huu? Ripoti au pendekeza marekebisho →