Skip to content

dlazesz/n-gram-benchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

N-gram Creation Methods Benchmark

This is a sample program to test speed and memory usage for different approaches for n-gram creation in Python

Usage

Create virtual environment and install requirements

make venv

Running the tests

make run

This command will do the following:

  1. Create a file of random length lorem ipsum sentences in one sentence per line (SPL) format
  2. Tests character and word 3-grams and 5-grams and print the output

See results in results.txt

Acknowledgment

This program could not be created without the joint efforts of Iván Mittelholcz, Bálint Sass, Bence Nyéki and Balázs Indig

License

This code is licensed under the MIT license

About

N-gram Creation Methods Benchmark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •