DAY 59-100 DAYS MLCODE: RL – Policy Gradient
In the previous blog, we discussed the Neural Network based policy, in this blog we are going to discuss the RL Policy Gradient. When we are playing a game like Frozen Lake in the previous blog, we may reach the goal but before reaching the goal, there may be various steps involved which result in…
Read more