Difference between revisions of "Reinforcement Learning"

Revision as of 22:36, 24 April 2024 (edit) 104.7.66.79 (talk) (→‎A2C) ← Older edit		Revision as of 18:56, 25 April 2024 (edit) (undo) 2.127.48.230 (talk) (→‎Training algorithms) Newer edit →
Line 7:		Line 7:

	===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===		===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
		+
		+	===[https://spinningup.openai.com/en/latest/algorithms/sac.html SAC]===

Revision as of 18:56, 25 April 2024