Changes

Jump to: navigation, search

Reinforcement Learning

55 bytes added, 24 April
A2C
==Training algorithms==
===[https://en.wikipedia.org/wiki/Advantage_Actor_Critic A2C]===
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
Anonymous user

Navigation menu