Open main menu
Humanoid Robots Wiki
β
Search
Changes
← Older edit
Newer edit →
Reinforcement Learning
55 bytes added
,
24 April
→
A2C
==Training algorithms==
===
[https://en.wikipedia.org/wiki/Advantage_Actor_Critic
A2C
]
===
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
Anonymous user
104.7.66.79