Open main menu

Humanoid Robots Wiki β

Changes

Reinforcement Learning

55 bytes added, 24 April
A2C
==Training algorithms==
===[https://en.wikipedia.org/wiki/Advantage_Actor_Critic A2C]===
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
Anonymous user