Reinforcement Learning

From Humanoid Robots Wiki
Revision as of 06:21, 16 May 2024 by Vrtnis (talk | contribs) (Training algorithms)
Jump to: navigation, search

Training algorithms

  • A2C (also see slides on Actor Critic methods at [1])
  • PPO
  • SAC

References

Resources