Difference between revisions of "Reinforcement Learning"

From Humanoid Robots Wiki
Jump to: navigation, search
(Training algorithms)
Line 1: Line 1:
 +
== Training algorithms ==
  
 +
* [https://en.wikipedia.org/wiki/Advantage_Actor_Critic A2C]
 +
* [https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]
 +
* [https://spinningup.openai.com/en/latest/algorithms/sac.html SAC]
  
 +
== Resources ==
  
==Training algorithms==
+
* [https://mandi-zhao.gitbook.io/deeprl-notes Mandy Zhao's Reinforcement Learning Notes]
 
 
===[https://en.wikipedia.org/wiki/Advantage_Actor_Critic A2C]===
 
 
 
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
 
 
 
===[https://spinningup.openai.com/en/latest/algorithms/sac.html SAC]===
 

Revision as of 01:52, 29 April 2024

Training algorithms

Resources