53
edits
Changes
→Overview
<syntaxhighlight lang="bash" line>
Initialize neural network with input dimensions = observation dimensions and output dimensions = action dimensions
For \# of episodes:
While not terminated:
Get observation from environment