Reinforcement Learning

From Humanoid Robots Wiki
Revision as of 06:05, 16 May 2024 by Vrtnis (talk | contribs) (Training algorithms)
Jump to: navigation, search

Training algorithms

  • A2C (also see slides on Actor Critic methods at [1] Stanford CS224R)
  • PPO
  • SAC

Resources