Reinforcement Learning
From Humanoid Robots Wiki
Revision as of 06:21, 16 May 2024 by
Vrtnis
(
talk
|
contribs
)
(
→
Training algorithms
)
(
diff
)
← Older revision
|
Latest revision
(
diff
) |
Newer revision →
(
diff
)
Jump to:
navigation
,
search
Training algorithms
A2C
(also see slides on Actor Critic methods at [1])
PPO
SAC
References
[1]
Stanford CS224R
Resources
Mandy Zhao's Reinforcement Learning Notes
Category
:
Software
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
Variants
Views
Read
Edit
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information