Changes
Jump to:
navigation
,
search
← Older edit
Newer edit →
Allen's REINFORCE notes
70 bytes added
,
25 May
→
Loss Function
=== Loss Function ===
The goal of REINFORCE is to optimize the expected cumulative reward.
Allen12
53
edits
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
Variants
Views
Read
Edit
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help
Tools
Special pages
Printable version