Difference between revisions of "Reinforcement Learning"

From Humanoid Robots Wiki
Jump to: navigation, search
(Created page with " ==Training algorithms== ===A2C=== ===PPO===")
 
(PPO)
Line 6: Line 6:
 
===A2C===
 
===A2C===
  
===PPO===
+
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===

Revision as of 22:36, 24 April 2024


Training algorithms

A2C

PPO