🎭 Emotion Classification with RoBERTa & Gradio

This project is a complete Sentiment Analysis pipeline that fine-tunes a RoBERTa model to classify text into six different emotions. The project handles dataset class imbalance using weighted loss and includes an interactive web demo built with Gradio and deployed to Hugging Face Spaces.

📊 Dataset

We use the dair-ai/emotion dataset. It contains English Twitter messages labeled with six basic emotions:

Label ID	Emotion
0	Sadness 😢
1	Joy 😂
2	Love 🥰
3	Anger 😡
4	Fear 😱
5	Surprise 😲

🛠️ Technical Approach

This project goes beyond standard fine-tuning by addressing class imbalance in the training data:

Data Preprocessing: Tokenization using RobertaTokenizer with truncation to a max length of 128.
Class Weights: We compute class weights using sklearn.utils.class_weight to penalize the model more for misclassifying minority classes (like Surprise).
Custom Trainer: A custom WeightedTrainer (subclassing Hugging Face's Trainer) is implemented to override the compute_loss method, injecting the calculated class weights into the CrossEntropyLoss.
Model: Fine-tuning roberta-base for sequence classification.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
HF_Workshop_Practice.ipynb		HF_Workshop_Practice.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎭 Emotion Classification with RoBERTa & Gradio

📊 Dataset

🛠️ Technical Approach

About

Uh oh!

Releases

Packages

Languages

License

rallm/sentiment-analysis-roberta

Folders and files

Latest commit

History

Repository files navigation

🎭 Emotion Classification with RoBERTa & Gradio

📊 Dataset

🛠️ Technical Approach

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages