StudyMate

Introduction

StudyMate is a Bangla extractive question-answering application tailored for students. It compares and deploys models trained on Bangla datasets to deliver precise answers swiftly, enhancing academic learning and performance.

Datasets

BanglaRQA

Source: BanglaRQA on Hugging Face

SQuAD_bn

Source: SQuAD_bn on Hugging Face

Filtered Dataset of BanglaRQA: BanglaRQA_to_SquadBn_fact_confirm

Filtered dataset of BanglaRQA involving factoid and confirmation type questions converted to SQuAD_bn dataset format

Source: BanglaRQA_to_SquadBn_fact_confirm on Hugging Face

Models

4 BERT based models XLM-RoBERTa, mBert, BanglaBERT and IndicBERT were used on 2 datasets to produce a total of 8 models as follows:

SQuAD_bn Trained Models

BanglaRQA Trained Models

XLM-RoBERTa

Link: xlm-roberta-base-finetuned-RQA-confirmation

mBERT

Link: bert-base-multilingual-cased-finetuned-RQA

BanglaBERT

Link: bangla-bert-base-finetuned-brqa-confirmation

IndicBERT

Link: indic-bert-finetuned-brqa-confirmation

Code

The code for training the models is provided in the repository

Running the application

To run the application, just run the notebook here

Results

F1 Score and Exact Match on SQuAD_bn dataset

Models	HasAns Total	HasAns Exact	HasAns F1	NoAns Total	NoAns Exact	NoAns F1	Exact	F1
Bangla BERT	625	15.52	27.29	575	0.00	0.00	8.08	14.22
Indic BERT	625	12.80	27.61	575	0.17	0.17	6.75	14.46
XLM Roberta	625	45.28	60.13	575	0.00	0.00	23.58	31.31
mBert	625	46.88	61.26	575	1.21	1.21	25.00	32.49

Table: Quantitative Evaluation of Various Models on SquadBN Dataset

F1 Score and Exact Match on BanglaRQA dataset

Models	HasAns Total	HasAns Exact	HasAns F1	NoAns Total	NoAns Exact	NoAns F1	Exact	F1
Bangla BERT	868	26.84	41.91	314	0.00	0.00	19.71	30.77
Indic BERT	868	13.94	33.16	314	2.23	2.23	10.83	24.94
XLM Roberta	868	64.98	81.53	314	0.64	0.64	47.89	60.04
mBert	868	63.13	80.04	314	0.00	0.00	46.36	58.78

Table: Quantitative Evaluation of Various Models on BanglaRQA Dataset

Training and Validation loss

Loss with SQuAD_bn dataset

Models	Training Loss	Evaluation Loss
Bangla BERT	1.82	2.26
Indic BERT	2.79	2.74
XLM Roberta	1.17	1.39
mBert	0.88	1.46

Table: Training and Evaluation Loss of Various Models on SquadBN Dataset

Loss with BanglaRQA dataset

Models	Training Loss	Evaluation Loss
Bangla BERT	1.07	1.41
Indic BERT	1.40	1.44
XLM Roberta	0.59	0.73
mBert	0.34	0.64

Table: Training and Evaluation Loss of Various Models on BanglaRQA Dataset

Images

The loss curves are provided in the folder here

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Images		Images
BanglaBERT_Base_BRQA.ipynb		BanglaBERT_Base_BRQA.ipynb
BanglaBERT_SquadBN.ipynb		BanglaBERT_SquadBN.ipynb
IndicBERT-BanglaRQA.ipynb		IndicBERT-BanglaRQA.ipynb
IndicBERT-SquadBN.ipynb		IndicBERT-SquadBN.ipynb
Project_UI_Gradio.ipynb		Project_UI_Gradio.ipynb
README.md		README.md
XLM-Base-RQA.ipynb		XLM-Base-RQA.ipynb
XLM-Base-SquadBN.ipynb		XLM-Base-SquadBN.ipynb
mBert-Base-RQA.ipynb		mBert-Base-RQA.ipynb
mBert-Base-SquadBN.ipynb		mBert-Base-SquadBN.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StudyMate

Table of Contents

Introduction

Datasets

BanglaRQA

SQuAD_bn

Filtered Dataset of BanglaRQA: BanglaRQA_to_SquadBn_fact_confirm

Models

SQuAD_bn Trained Models

XLM-RoBERTa

mBERT

BanglaBERT

IndicBERT

BanglaRQA Trained Models

XLM-RoBERTa

mBERT

BanglaBERT

IndicBERT

Code

Running the application

Results

F1 Score and Exact Match on SQuAD_bn dataset

F1 Score and Exact Match on BanglaRQA dataset

Training and Validation loss

Loss with SQuAD_bn dataset

Loss with BanglaRQA dataset

Images

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

AsifCantCode/StudyMate

Folders and files

Latest commit

History

Repository files navigation

StudyMate

Table of Contents

Introduction

Datasets

BanglaRQA

SQuAD_bn

Filtered Dataset of BanglaRQA: BanglaRQA_to_SquadBn_fact_confirm

Models

SQuAD_bn Trained Models

XLM-RoBERTa

mBERT

BanglaBERT

IndicBERT

BanglaRQA Trained Models

XLM-RoBERTa

mBERT

BanglaBERT

IndicBERT

Code

Running the application

Results

F1 Score and Exact Match on SQuAD_bn dataset

F1 Score and Exact Match on BanglaRQA dataset

Training and Validation loss

Loss with SQuAD_bn dataset

Loss with BanglaRQA dataset

Images

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages