SentiSynth explores how synthetic data can improve sentiment analysis models when labeled data is scarce.
# Clone the repository
git clone https://github.com/yourusername/sentisynth.git
cd sentisynth
# Create a virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install dependencies
pip install -e .
pip install -r requirements-dev.txtMore details coming soon!
sentisynth/: Main packagedata/: Data loading and processingmodels/: Model implementationsgeneration/: Synthetic data generationevaluation/: Evaluation metrics and analysis
tests/: Unit testsnotebooks/: Jupyter notebooks for explorationscripts/: Utility scripts
To run on weftdrive:
nohup /srv/gpurun.pl python src/senti_synth/cli/01_train_teacher.py configs/teacher/stt2_hf.yaml > ~/scratch/senti_synth/logs/$(date +%Y%m%d_%H%M).log 2>&1 &- SSH into weftdrive:
ssh paramkapur@weftdrive.private.reed.edu - Git clone the repository:
git clone https://github.com/paramkpr/senti_synth.git - Setup the conda environment
/srv/conda/bin/conda initandsource ~/.bashrc - Enter the conda environment
conda activate deep-learning- Check what packages are installed
conda list - Install the packages for the project
pip install -r requirements.txt - Install the project
pip install -e .
- Check what packages are installed
- SCP
data/cleantoweftdrive:~/scratch/data/clean:scp -r data/clean paramkapur@weftdrive.private.reed.edu:~/scratch/paramkapur/data/clean- Ensure that the config file points to the correct path:
dataset_path: "~/scratch/data/clean"
- Ensure that the config file points to the correct path:
- Setup W&B:
export WANDB_API_KEY="..."python -m wandb login
- Create the logs directory and file:
mkdir -p ~/scratch/paramkapur/logsandtouch ~/scratch/paramkapur/logs/$(date +%Y%m%d_%H%M).log - Run the training script:
nohup /srv/gpurun.pl python src/cli/01_train_teacher.py configs/teacher/sst2_hf.yaml > ~/scratch/paramkapur/logs/$(date +%Y%m%d_%H%M).log 2>&1 &
/scratch/paramkapur/data/clean/clean
nohup /srv/gpurun.pl python src/cli/01_train_teacher.py configs/teacher/sst2_hf.yaml > ~/scratch/paramkapur/logs/$(date +%Y%m%d_%H%M).log 2>&1 &