Distractors

This repository is composed of the main files to the research entitled: "LLM-Based Automatic Generation of Multiple-Choice Questions With Meaningful Distractors" Most of the code used Langfuse as a tool to manage the prompts version and the entire pipeline of our experiment. That is why some code calls this framework; however, our presented code is enough to replicate our results without using Langfuse.

To Summarize the important files:

Result folder contain the computed results of our test evaluation dataset.
- maritaca-sabia3-BF.csv
- maritaca-sabia3-DT.csv
- openai_gpt4o-mini-BF.csv
- openai_gpt4o-mini-DT.csv
Notebooks
- 1. Pipeline: creating
- 1. Evaluators: compute the evaluations metrics (diversity and LLM as judge) for the generated distractors.
- 1. Result_analysis: a brief analysis with code of how we extract the metrics.
Prompts A set of collection of prompts files used in the research.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
plots		plots
prompts		prompts
results		results
1.Pipeline.ipynb		1.Pipeline.ipynb
2.Evaluators.ipynb		2.Evaluators.ipynb
3.Result_analysis.ipynb		3.Result_analysis.ipynb
OverallEvaluator.py		OverallEvaluator.py
Pipfile		Pipfile
README.md		README.md
create_exams_dev.py		create_exams_dev.py
distractor_generator.py		distractor_generator.py
llm_judge.py		llm_judge.py
metrics.py		metrics.py
schema.py		schema.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distractors

About

Uh oh!

Releases

Packages

Languages

Studyard/Distractors

Folders and files

Latest commit

History

Repository files navigation

Distractors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages