Open main menu
Humanoid Robots Wiki
β
Search
Edit
Reinforcement Learning
Revision as of 06:05, 16 May 2024 by
Vrtnis
(
talk
|
contribs
)
(
→
Training algorithms
)
(
diff
)
← Older revision
|
Latest revision
(
diff
) |
Newer revision →
(
diff
)
Training algorithms
A2C
(also see slides on Actor Critic methods at
[1]
Stanford CS224R)
PPO
SAC
Resources
Mandy Zhao's Reinforcement Learning Notes