Scholar
Gate
Msaidizi
Nyanja zote
▾
SW ▾
Kuhusu
Reference
Swali na Muundo
Sampuli na Upimaji
Uchanganuzi
Usababishi na Ushahidi
Kuripoti na Maadili
Mwanzo
/
Mwandishi
Christiano, P. et al.; Ouyang, L. et al.
Mbinu zinazohusishwa na mwandishi huyu.
Mbinu 1
Ujifunzaji wa Kina
1
Fine-Tuned Reinforcement Learning
2017