Highly Rewarding

A framework for training large language models with custom loss functions and model architectures. This project provides a flexible and extensible training pipeline that supports various model types, datasets, and training configurations.

Setup

Clone the repository:

git clone https://github.com/efrick2002/highly-rewarding.git

Run the setup script:

cd highly-rewarding
source setup.sh

Note: the setup script installs uv. The environment in activated with source .venv/bin/activate.

Project Structure

train.py: Main training script
modeling.py: Model architecture definitions and registry
losses.py: Custom loss function implementations
dataset.py: Dataset handling and data loading
utils.py: Utility functions
model_type_registry.py: Model type registration system

Usage

wandb login
Configure your training parameters in a YAML config file
Run the training script:

deepspeed --num_gpus=8 train.py -c configs/bt_debug.yaml

Configuration

The training configuration should be specified in a YAML file.

See configs for examples.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Highly Rewarding

Setup

Project Structure

Usage

Configuration

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
configs		configs
data		data
deepspeed		deepspeed
README.md		README.md
dataset.py		dataset.py
losses.py		losses.py
model_type_registry.py		model_type_registry.py
modeling.py		modeling.py
probe_barrier.py		probe_barrier.py
requirements.txt		requirements.txt
runpod_setup.sh		runpod_setup.sh
setup.sh		setup.sh
train.py		train.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

Highly Rewarding

Setup

Project Structure

Usage

Configuration

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages