ChatBot-Personality-Recognition

Inspiration from “A neural chatbot with personality” by Nguyen et al personality. Published at the Semantic Scholar (2017).

In our previous project we created a chatbot which imitates characters from popular tv shows. Our approach improved on their work thanks to newer architectures based on transformers. Plus, we proposed a more rigorous quantitative analysis of performances using a suite of metrics.

Data

We took from scripts of famous films and tv shows, of different level of quality, pre-processing them to extract pairs of well-formatted (line, character) rows. Here the main statisticks of our dataset:

Character Name	# Lines	Gained Lines	Show/Movie	# Show Lines
Barney	5194	3.8%	HIMYM	31776
Bender	2388	1.1%	Futurama	15226
Fry	2716	1.4%	Futurama	15226
Harry	1037	27.8%	Harry Potter	4925
Joey	8229	10.5%	Friends	61023
Phobe	7460	10.2%	Friends	61023
Sheldon	11642	2.5%	TBBT	51268
Vader	160	15.7%	Star Wars	2750

Chatbot Model

Architecture: autoregressive model called DialoGPT, built on top of GTP-2, fine-tuned independently (different weights) on each character, using the HuggingFace 🤗 library and Tensorflow. We tested it with different generation parameters: Greedy, Beams Search and Sampling.

The task

Our previous project conlcuded that at the moment there isn't any metric capable of completely evaluating a chatbot, in a multilateral aspect (i.e. context align, semantic and personality affinity). Finding a correct and a good suite of metrics is still an open problem.

We attempted to introduce some possible metrics (neural and algorithmic) capable of sitinguishing personlaity of different charactes from text. We argued that the problem is quite hard in context where characters come from the same tv show/movie. We challenged this task of personality recognition exploring the concept of semantic from different perspective and with different approaches, but we think that the problem is still open.

New contributions

In this project two new approaches for personality recognition are explored:

a sentence graph based embedding approach (PersGRAPH classifier)
a supervised topic modeling classifier approach (BERTopic classifier_)

Setup

Create a virtual environment running the following code:

python -m venv ./venv

Activate thevirtual environment by executing the script contained in the folder venv
Run the installation of requirements

pip install -r ./requirements.txt

now you are ready to execute the code of the jupyter notebooks in the folder notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
images		images
notebooks		notebooks
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatBot-Personality-Recognition

Data

Chatbot Model

The task

New contributions

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

falric05/ChatBot-Personality-Recognition

Folders and files

Latest commit

History

Repository files navigation

ChatBot-Personality-Recognition

Data

Chatbot Model

The task

New contributions

Setup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages