Deep RL Networks

In order to understand PyTorch and DeepRL on a deeper level, I decided to implement some algorithms from scratch using PyTorch. These are my implementations of the common reinforcement learning algorithms DQN, A2C, and PPO using Pytorch and Gymnasium. Feel free to use the code as a reference. My skill in pytorch increased by a lot with every algorithm, so the older the file, the more likely you are to find bad practices.

Check out this presentation I made!

I made a presentation about the material to help me digest it. If you find any problems please reach out! Intro to DeepRL

Resources

DQN Paper
DQN Paper 2
DQN Guide
A2C/A3C Paper
Hugging Face A2C
PPO Paper
Hugging Face PPO
PPO Implementation Details
GAE Value Function Estimation (PPO)

Recommended Background

Sutton and Barto: RL Introduction. Chapters 1-7
OpenAI: Spinning Up as a Deep RL Researcher. "The Right Background"

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.vscode		.vscode
A2C		A2C
DQN		DQN
PPO		PPO
.python-version		.python-version
README.md		README.md
main.py		main.py
policy.pth		policy.pth
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep RL Networks

Check out this presentation I made!

Resources

Recommended Background

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deep RL Networks

Check out this presentation I made!

Resources

Recommended Background

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages