Open main menu
Humanoid Robots Wiki
β
Search
Changes
← Older edit
Newer edit →
Allen's REINFORCE notes
70 bytes added
,
00:59, 25 May 2024
→
Loss Function
=== Loss Function ===
The goal of REINFORCE is to optimize the expected cumulative reward.
Allen12
53
edits