PitchBench

A simple experimental benchmark to test the ability of language models to retrieve pitch accent patterns for Japanese words.

Versions

V1: A proof-of-concept benchmark using a non-optimized, manually added medical and misc vocabulary. Results are scored out of 30.
V2: An improved benchmark using a random sample of words that follows Zipf’s Law in the BCCWJ SUW LUW Combined word frequency dictionary (originally for Yomitan). This benchmark should much more accurately evaluate a model's ability to predict pitch accent in natural Japanese text. Results are reported as a percentage, based on the success-rate of 50 words.

Setup

Clone the repository:
Install the required packages:
```
pip install -r requirements.txt
```
Set up your OpenRouter API key in the .env file based on the template.
Run the benchmark:
```
cd v2 # or cd v1
python main.py
```

Methodology

V1

In a prompt, 30 japanese words are provided, and the model is asked to return their pitch accent patterns in Tokyo Standard Japanese using specific labels (H, A, N2, N3, ..., O). The results are compared against a predefined solution to calculate scores and token usage:

1 point is awarded for each correct pitch accent pattern.
0.5 is awarded if the model guessed a correct pattern within multiple possible answers.
0 is awarded if the model makes a wrong guess or guesses too many accents for a single word.

V2

Same as above, except:

50 words are provided.
1 point is awarded regardless of if multiple pitch accents were valid.
Pitch accent patterns are simplified to H, A, N, and O, and a model only provides one per word.

Results

V2

V1

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
v1		v1
v2		v2
.env.template		.env.template
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PitchBench

Versions

Setup

Methodology

Results

About

Uh oh!

Releases

Packages

Languages

Shewiiii/PitchBench

Folders and files

Latest commit

History

Repository files navigation

PitchBench

Versions

Setup

Methodology

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages