Allen's REINFORCE notes
Allen's REINFORCE notes
Links
Motivation
Learning
Learning involves the agent taking actions and the environment returning a new state and reward.
- Input: : States at each time step
- Output: : Actions at each time step
- Data:
- Learn to maximize