Objective: Build a working tic-tac-toe program which can later be optimized with reinforcement learning techniques.
main.pyruns the entire program.state.pyrelies solely on prewritten .txt files. The rest rely on .txt files generated by other .py files.- Before running
main.py,calc-reward-output.txtshould always be in this format (whereX= # of player wins):
The player has won:
X
times
- Program runs successfully with current input files
- Missing detection for win/lose conditions
- Missing algorithm for reinforcement learning (will presumably be written in
main.py)