- Atari (KungFu-v0): Deep Q-Learning model
- Smart-Taxi: Q-Learning
- SonicTheHedgehog: PPO (Proximal Policy Optimization) algorithm with the Actor Advantage Critic (A2C) model
This repository contains three main projects that focus on implementing various RL algorithms for different environments and games (TensorFlow, OpenAI Gym).
- Atari (KungFu-v0)
The Atari section of the repository focuses on implementing the Deep Q-Network (DQN) algorithm for the "KungFu-v0" Atari game. The main code file for this section is DQNAtariKungFuv0.py. The steps involved in this section are as follows: - Smart-Taxi
The Smart-Taxi section implements the Q-Learning algorithm for a Smart-Taxi environment. The main code file for this section is SmartTaxi.py. The steps involved in this section are as follows:- Import the necessary libraries and create an instance of the Smart-Taxi environment.
- Initialize the Q-table with zeros to store the Q-values.
- Define the epsilon-greedy policy function for action selection.
- Implement the main training loop: interact with the environment, update Q-values based on the Q-Learning algorithm, and repeat for a specified number of episodes.
- Include a testing loop to navigate the Smart-Taxi environment using the learned Q-values.
- Evaluate the agent's performance over a specified number of test episodes and print the average score.

- Sonic the Hedgehog-PPO
The Sonic the Hedgehog-PPO section focuses on training an agent to play the game "Sonic the Hedgehog" using the Proximal Policy Optimization (PPO) algorithm. The main code file for this section is run.py. The steps involved in this section are as follows:
Please refer to the respective sections for more details on each algorithm's implementation and usage.

