Open main menu

Humanoid Robots Wiki β

Reinforcement Learning

Revision as of 06:05, 16 May 2024 by Vrtnis (talk | contribs) (Training algorithms)

Training algorithms

  • A2C (also see slides on Actor Critic methods at [1] Stanford CS224R)
  • PPO
  • SAC

Resources