Difference between revisions of "Reinforcement Learning"

From Humanoid Robots Wiki
Jump to: navigation, search
(A2C)
(Training algorithms)
Line 7: Line 7:
  
 
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
 
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
 +
 +
===[https://spinningup.openai.com/en/latest/algorithms/sac.html SAC]===

Revision as of 18:56, 25 April 2024


Training algorithms

A2C

PPO

SAC