Difference between revisions of "Allen's REINFORCE notes"
Line 3: | Line 3: | ||
=== Links === | === Links === | ||
− | * [http://www.incompleteideas.net/book/RLbook2020.pdf | + | * [http://www.incompleteideas.net/book/RLbook2020.pdf RLbook2020] |
[[Category:Reinforcement Learning]] | [[Category:Reinforcement Learning]] |
Revision as of 20:24, 24 May 2024
Allen's REINFORCE notes
Links
Motivation
Learning
Learning involves the agent taking actions and the environment returning a new state and reward.
- Input: : States at each time step
- Output: : Actions at each time step
- Data:
- Learn to maximize