== Training algorithms ==
* [https://en.wikipedia.org/wiki/Advantage_Actor_Critic A2C]
* [https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]
* [https://spinningup.openai.com/en/latest/algorithms/sac.html SAC]
== Resources ==
==Training algorithms== ===* [https://enmandi-zhao.wikipediagitbook.orgio/wiki/Advantage_Actor_Critic A2C]=== ===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]=== ===[https://spinningup.openai.com/en/latest/algorithms/sac.html SACdeeprl-notes Mandy Zhao's Reinforcement Learning Notes]===