Reinforcement Learning (RL) and Deep RL Projects

Atari (KungFu-v0): Deep Q-Learning model
Smart-Taxi: Q-Learning
SonicTheHedgehog: PPO (Proximal Policy Optimization) algorithm with the Actor Advantage Critic (A2C) model

This repository contains three main projects that focus on implementing various RL algorithms for different environments and games (TensorFlow, OpenAI Gym).

Atari (KungFu-v0)
The Atari section of the repository focuses on implementing the Deep Q-Network (DQN) algorithm for the "KungFu-v0" Atari game. The main code file for this section is DQNAtariKungFuv0.py. The steps involved in this section are as follows:
- Pre-process the game frames.
- Define the network architecture.
- Train the agent using the DQN algorithm.
- Implement utility functions for memory replay, epsilon-greedy action selection, and frame stacking.
- Save the trained model in the saved_models directory.
Smart-Taxi
The Smart-Taxi section implements the Q-Learning algorithm for a Smart-Taxi environment. The main code file for this section is SmartTaxi.py. The steps involved in this section are as follows:
- Import the necessary libraries and create an instance of the Smart-Taxi environment.
- Initialize the Q-table with zeros to store the Q-values.
- Define the epsilon-greedy policy function for action selection.
- Implement the main training loop: interact with the environment, update Q-values based on the Q-Learning algorithm, and repeat for a specified number of episodes.
- Include a testing loop to navigate the Smart-Taxi environment using the learned Q-values.
- Evaluate the agent's performance over a specified number of test episodes and print the average score.
Sonic the Hedgehog-PPO
The Sonic the Hedgehog-PPO section focuses on training an agent to play the game "Sonic the Hedgehog" using the Proximal Policy Optimization (PPO) algorithm. The main code file for this section is run.py. The steps involved in this section are as follows:
- Implement the Advantage Actor-Critic (A2C) architecture.
- Train the agent using the PPO algorithm.
- Play the game in the OpenAI Gym environment.
- Provide utility functions for the project, such as loading model checkpoints and saving the trained model.

Please refer to the respective sections for more details on each algorithm's implementation and usage.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Atari		Atari
Smart-Taxi		Smart-Taxi
SonicTheHedgehog-PPO		SonicTheHedgehog-PPO
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning (RL) and Deep RL Projects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning (RL) and Deep RL Projects

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages