Open main menu

Humanoid Robots Wiki β

Changes

Allen's REINFORCE notes

70 bytes added, 25 May
Loss Function
=== Loss Function ===
 
The goal of REINFORCE is to optimize the expected cumulative reward.
53
edits