Changes
Jump to:
navigation
,
search
← Older edit
Newer edit →
Reinforcement Learning
55 bytes added
,
24 April
→
A2C
==Training algorithms==
===
[https://en.wikipedia.org/wiki/Advantage_Actor_Critic
A2C
]
===
===[https://en.wikipedia.org/wiki/Proximal_policy_optimization PPO]===
Anonymous user
104.7.66.79
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
Variants
Views
Read
Edit
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help
Tools
Special pages