Open main menu
Humanoid Robots Wiki
β
Search
Changes
← Older edit
Newer edit →
Allen's REINFORCE notes
70 bytes added
,
25 May
→
Loss Function
=== Loss Function ===
The goal of REINFORCE is to optimize the expected cumulative reward.
Allen12
53
edits