DAY 59-100 DAYS MLCODE: RL – Policy Gradient

In the previous blog, we discussed the Neural Network based policy, in this blog we are going to discuss the RL Policy Gradient. When we are playing a game like Frozen Lake in the previous blog, we may reach the goal but before reaching the goal, there may be various steps involved which result in…
Read more


January 8, 2019 0