Open main menu

Humanoid Robots Wiki β

Changes

Allen's REINFORCE notes

1 byte added, 25 May
Overview
‎<syntaxhighlight lang="bash" line>
Initialize neural network with input dimensions = observation dimensions and output dimensions = action dimensions
For \# of episodes:
While not terminated:
Get observation from environment
53
edits