Search results

Jump to: navigation, search
  • ...iate objective with respect to the optimal theta and then perform gradient descent ...e value function or q-function of current policy, and find a better policy gradient
    5 KB (891 words) - 23:55, 24 May 2024
  • Based on the loss, use a gradient descent policy to update weights Now we want to find the gradient of <math> J (\theta) </math>, namely
    5 KB (852 words) - 01:23, 26 May 2024
  • Notes of various riffs on Gradient Descent from a perspective of neural networks. [[Category: Gradient Descent]]
    9 KB (1,469 words) - 04:57, 25 May 2024