Skip to content

snnclsr/rl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

In this repository, I will share my knowledge and works about reinforcement learning. I will write blog posts and implement the algorithms in order to understand them well. I will also share some articles that will help to understand the concepts better.

Algorithms

Vanilla Policy Gradients(REINFORCE)

Check out the blog post for detailed explanation.

Summary: Policy gradient algorithms directly learn/optimize the policy. We generate samples from the environment. We calculate the sum of gradients along the samples and, also we compute the total reward for each sample. We multiply them and optimize with gradient ascent.

Code

Skeleton code for the implementation is taken from Berkeley RL Course Assignment 2 which can be found here. Also check out the course content from here.

About

Repository about reinforcement learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published