散文網(wǎng) » 科技 »學(xué)習(xí) » Reinforcement Learning_Code_Policy Gradient

Reinforcement Learning_Code_Policy Gradient

2023-04-10 23:35 作者:別叫我小紅 0人讀過 | 我要投稿

Following results and code are the implementation of policy gradient, including REINFORCE, in Gymnasium's Cart Pole environment.

RESULTS:

Visualizations of (i) changes in scores and?losses, and (ii) animation results.

Since REINFROCE makes use of?Monte Carlo estimation, its convergence rate is slow and it does?not converge after 10 thousand steps.

However, it has got a not too bad result and is hopefully to achieve more than 200 points if?more steps are given.

CODE:

NetWork.py

REINFORCEAgent.py

train_and_test.py

The above code are mainly based on Chapter 9 of?Hands-on Reinforcement Learning [1] and my previous implementation of value function apporximation with Mente Carlo [2].

Reference

[1]?https://hrl.boyuai.com/

[2]?https://www.bilibili.com/read/cv22924612

標(biāo)簽：強化學(xué)習(xí)

Reinforcement Learning_Code_Policy Gradient的評論 (共條)

愛情散文傷感散文哲理散文優(yōu)美生活隨筆親情唯美句子傷感的句子現(xiàn)代詩歌空間日志經(jīng)典語句愛情句子作文大全

五月天青色头像情侣网名,国产亚洲av片在线观看18女人,黑人巨茎大战俄罗斯美女,扒下她的小内裤打屁股

Reinforcement Learning_Code_Policy Gradient

Reinforcement Learning_Code_Policy Gradient的評論 (共條)

你可能也喜歡這些文章

最新發(fā)布的文章

五月天青色头像情侣网名,国产亚洲av片在线观看18女人,黑人巨茎大战俄罗斯美女,扒下她的小内裤打屁股

Reinforcement Learning_Code_Policy Gradient

本文作者的其他文章

Reinforcement Learning_Code_Policy Gradient的評論 (共 條)

你可能也喜歡這些文章

最新發(fā)布的文章

Reinforcement Learning_Code_Policy Gradient的評論 (共條)