Open main menu

Humanoid Robots Wiki β

Changes

Allen's REINFORCE notes

70 bytes added, 00:59, 25 May 2024
Loss Function
=== Loss Function ===
 
The goal of REINFORCE is to optimize the expected cumulative reward.
53
edits