Finetune

This is just a modified version of the prepare_dataset and train scripts from https://github.com/pacman100/DHS-LLM-Workshop to finetune a LLM

prepare_dataset.py is used to create dataset and upload it to hugging face. Usually called on a local machine.

train.py is used to train a given model with a given dataset. This is used in the colab notebooks.

requirements.txt contains all the needed packages to train a model. This is used in the colab notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LICENSE		LICENSE
README.md		README.md
fim.py		fim.py
prepare_dataset.py		prepare_dataset.py
requirements.txt		requirements.txt
test_dataset.py		test_dataset.py
train.py		train.py

Provide feedback