Toxicity-Classification

Description:

This project aims to detect and classify hate speech into six categories: obscene, threatening, insulting, toxic, severely toxic, and identity hate. It utilizes machine learning models such as SVM, logistic regression, extra trees, XGBoost, and LSTM. The project addresses the challenges of multiclass and multilabel classification and incorporates classifier chains to improve performance.
Out of the models mentioned, XGBoost preforms the best.

Output:

Model	Mean AUC_ROC score
SVM (Binary Relevance)	`0.66`
SVM (Classifier Chains)	`0.67`
Logistic Regression (Binary Relevance)	`0.73`
Logistic Regression (Classifier Chains)	`0.76`
Extra Trees	`0.93`
XGBoost	`0.96`

Dataset and other files:

https://drive.google.com/drive/folders/1kooEeZ5QE3eteVic6QINXOakli5VIBYZ?usp=sharing

UI:

Home Screen:

Output screen using the example "damn you idiot!":

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
models		models
static		static
templates		templates
ExtraTrees.ipynb		ExtraTrees.ipynb
LSTM_Toxicity_Classification.ipynb		LSTM_Toxicity_Classification.ipynb
LogisticRegression.ipynb		LogisticRegression.ipynb
NaiveBayes.ipynb		NaiveBayes.ipynb
README.md		README.md
SupportVectorMachine.ipynb		SupportVectorMachine.ipynb
Visualisation.ipynb		Visualisation.ipynb
XGBoost.ipynb		XGBoost.ipynb
identity_hate_xgboost.json		identity_hate_xgboost.json
insult_xgboost.json		insult_xgboost.json
main.py		main.py
obscene_xgboost.json		obscene_xgboost.json
requirements.txt		requirements.txt
severe_toxic_xgboost.json		severe_toxic_xgboost.json
threat_xgboost.json		threat_xgboost.json
toxic_xgboost.json		toxic_xgboost.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Toxicity-Classification

Description:

Output:

Dataset and other files:

UI:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

003ash/Toxicity-Classification

Folders and files

Latest commit

History

Repository files navigation

Toxicity-Classification

Description:

Output:

Dataset and other files:

UI:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages