BiAtt-HateXplain

This algorithm is derived from the BiRNN-HateXplain algorithm, and this project is based on the HateXplain project https://github.com/hate-alert/HateXplain/tree/master (Associated article : https://arxiv.org/pdf/2012.10289) The results of our studies can be found in the models_and_results/BiAtt_BiRNN_max_2 folder.

File example_hatexplain_with_BiRNN-HateXplain_vs_BiAtt_BiRNN-HateXplain.ipynb contains examples of comparison between ground truth attention, attention predicted by BiRNN-HateXplain and that predicted by BiAtt-BiRNN-HateXplain.

To train a proposed model use the file Example_HateExplain.ipynb

Objective

The objective of this project is to improve the results of the BiRNN-HateXplain and BERT-HateXplain algorithms in terms of detection performance, unintentional bias, and explainability.

Problem with current approaches

In current algorithms such as BiRNN-HateXplain, we observe a large variation in the estimated attention when it should be constant.

We observe for example that in the interval [8, 20] of the plot above, the attention estimated by the BiRNN-HateXplain model varies a lot when it should be constant.

Proposal

Our hypothesis is that considering the sequential aspect of input data in HateXplain models could resolve the variability of attention and improve explainability. And then, it can also improve classification performance and unintentional biases related to communities indexed in hate speech because it uses multi-task learning(classification and explainability tasks).

Results

The results show that the proposed approach improves explainability, prediction performance, and metrics that measure unintentional biases of the model. We also observed that the attention estimated by the proposed approach estimates constant attention when it should be.

In the above figure in the interval [8, 20] compared to BiRNN-HateXplain, the attention of the proposed model is constant when it should be in reality.

Installation

It is recommended to use a tool like conda to create a virtual environment and facilitate conflict management. Install the appropriate packages contained in the requirements.txt file

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vscode		.vscode
Figures		Figures
Models		Models
Models_and_results		Models_and_results
Preprocess		Preprocess
TensorDataset		TensorDataset
best_model_json		best_model_json
eraserbenchmark		eraserbenchmark
.gitignore		.gitignore
1.png		1.png
13.png		13.png
2.png		2.png
Bias_Calculation_NB.ipynb		Bias_Calculation_NB.ipynb
Example_HateExplain.ipynb		Example_HateExplain.ipynb
Example_HateExplain_v1.ipynb		Example_HateExplain_v1.ipynb
Explainability_Calculation_NB.ipynb		Explainability_Calculation_NB.ipynb
LICENSE		LICENSE
Parameters_description.md		Parameters_description.md
Pipfile		Pipfile
Question_master.txt		Question_master.txt
README.md		README.md
Untitled.ipynb		Untitled.ipynb
bestModel_birnnscrat_100_explanation_top5_max.json		bestModel_birnnscrat_100_explanation_top5_max.json
bestModel_birnnscrat_100_explanation_top5_min.json		bestModel_birnnscrat_100_explanation_top5_min.json
bestModel_birnnscrat_100_explanation_top5_normal.json		bestModel_birnnscrat_100_explanation_top5_normal.json
best_runs.sh		best_runs.sh
conda		conda
convert_to_word2vec.py		convert_to_word2vec.py
example_hatexplain_with_BiRNN-HateXplain_vs_BiAtt_BiRNN-HateXplain.ipynb		example_hatexplain_with_BiRNN-HateXplain_vs_BiAtt_BiRNN-HateXplain.ipynb
manual_training_inference.py		manual_training_inference.py
manual_training_inference_v1.py		manual_training_inference_v1.py
model_explain_output.json		model_explain_output.json
parameters_selection.py		parameters_selection.py
requirements.txt		requirements.txt
test_parallel.sh		test_parallel.sh
testing_for_bias.py		testing_for_bias.py
testing_with_lime.py		testing_with_lime.py
testing_with_rational.py		testing_with_rational.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BiAtt-HateXplain

Objective

Problem with current approaches

Proposal

Results

Installation

About

Uh oh!

Releases

Packages

Languages

License

pharaon-dev/BiAttention-HateXplain

Folders and files

Latest commit

History

Repository files navigation

BiAtt-HateXplain

Objective

Problem with current approaches

Proposal

Results

Installation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages