Find rhyming words in Spanish.
Already prepared in the repo, but in case I forget:
Frequency list: RAE Corpus, ./CREA_total.TXT. Note that the list is iso-8859-1 encoded, It has to be converted to UTF8 before use.
Dictionary: hunspell dictionaries from https://github.com/wooorm/dictionaries, extracted to word list with unmunch, ./words.txt.
# Initialize word frequency map
cargo run --release --bin init
# Start the console app
cargo run --release --bin query
# Start the HTTP API
cargo run --release --bin serve- Order by number of syllables
- Order by frequency
- Ability to choose dictionary
- Speed
- Metric syllables
- Assonant rhymes
- homophonous consonants, namely
llandy
And the assorted functionalities that exist in https://buscapalabras.com.ar/rimas.php
query binary takes 100ms to be ready. Memory footprint stays below 50M.
syllabize-escrate, my own dog food.serdewithbincode, good stuff.
Dunno. The dictionary I used is GPL, but they are just data. Is it OK to use a more permissive license for the code?