Open main menu
Humanoid Robots Wiki
β
Search
Edit
Reinforcement Learning
Revision as of 06:21, 16 May 2024 by
Vrtnis
(
talk
|
contribs
)
(
→
Training algorithms
)
(
diff
)
← Older revision
|
Latest revision
(
diff
) |
Newer revision →
(
diff
)
Training algorithms
A2C
(also see slides on Actor Critic methods at [1])
PPO
SAC
References
[1]
Stanford CS224R
Resources
Mandy Zhao's Reinforcement Learning Notes