GitHub - harpertoken/benchmark: speech model benchmark.

files

harpertoken/ – core modules
tests/ – test scripts
scripts/ – utilities
main.py – entry point
requirements.txt – dependencies

usage

whisper model

from harpertoken.model import SpeechModel
from harpertoken.dataset import LiveSpeechDataset
from transformers import WhisperProcessor
import torch

# use tiny model for faster testing (or whisper-small for better quality)
model = SpeechModel(model_type="whisper")
processor = WhisperProcessor.from_pretrained("openai/whisper-tiny")

dataset = LiveSpeechDataset()
audio = dataset.record_audio()

# process audio
inputs = processor(audio, sampling_rate=16000, return_tensors="pt")
attention_mask = torch.ones(
    inputs.input_features.shape[0],
    inputs.input_features.shape[1],
    dtype=torch.long,
)

# generate transcription
with torch.no_grad():
    generated_ids = model.generate(
        input_features=inputs.input_features,
        attention_mask=attention_mask,
        language="en",
        task="transcribe",
    )

# decode transcription
transcription = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(f"transcription: {transcription}")

wav2vec2 model

from harpertoken.model import SpeechModel
from harpertoken.dataset import LiveSpeechDataset
from transformers import Wav2Vec2FeatureExtractor
import torch

model = SpeechModel(model_type="wav2vec2")
processor = Wav2Vec2FeatureExtractor.from_pretrained("facebook/wav2vec2-base-960h")

dataset = LiveSpeechDataset()
audio = dataset.record_audio()

inputs = processor(audio, sampling_rate=16000, return_tensors="pt")

with torch.no_grad():
    features = model(inputs.input_values)

print(features.shape)

training

from harpertoken.train import train_model

# train the model (whisper or wav2vec2)
train_model(model_type="whisper")  # or "wav2vec2"

testing

see docs/TESTING.md for detailed testing instructions.

quick test

# activate virtual environment
source venv/bin/activate

# run all tests
python run_tests.py

# run individual tests
python -m unittest tests.test_unit
python tests/test_transcription.py --model_type whisper

programmatic testing

# unit tests
from harpertoken.model import SpeechModel
from harpertoken.dataset import LiveSpeechDataset
from harpertoken.evaluate import compute_metrics

# test model creation
model = SpeechModel(model_type="whisper")
dataset = LiveSpeechDataset()

# test evaluation
predictions = ["hello", "world"]
labels = ["hello", "world"]
wer, cer = compute_metrics(predictions, labels)
print(f"wer: {wer}, cer: {cer}")

# live transcription test
from tests.test_transcription import test_transcription
test_transcription(model_type="whisper")

docker

build the docker image:

docker build -t benchmark .

run the container:

docker run benchmark

related models

harpertokenASR on Hugging Face

versioning

this project uses semantic versioning with automated releases via semantic-release.

versions are automatically bumped and tagged based on conventional commit messages:

feat: commits trigger minor version bumps
fix: commits trigger patch version bumps
BREAKING CHANGE in commits trigger major version bumps

releases are created on pushes to the main branch. check the releases page for version history.

dependencies

this project uses:

requirements.txt - minimum version constraints for flexibility
requirements-lock.txt - exact versions tested across all platforms
dependabot - automated dependency updates via pull requests

dependabot creates weekly prs to update dependencies, grouped by category (pytorch, transformers, audio, etc.) and tested across python 3.8-3.12 on ubuntu, macos, and windows.

ci workflow

the ci workflow runs automated tests, linting, and formatting checks on every push and pull request to the main branch using github actions. it includes:

unit tests across python 3.8-3.12
ruff linting (including unused code detection)
ruff formatting checks
docker image build and test

note: local git hooks can enforce checks before commit/push; see docs/testing.md for setup.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
.github		.github
docs		docs
harpertoken		harpertoken
scripts		scripts
tests		tests
.gitignore		.gitignore
.releaserc.json		.releaserc.json
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements-lock.txt		requirements-lock.txt
requirements.txt		requirements.txt
run_tests.py		run_tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

files

usage

whisper model

wav2vec2 model

training

testing

quick test

programmatic testing

docker

related models

versioning

dependencies

ci workflow

About

Uh oh!

Releases 22

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

harpertoken/benchmark

Folders and files

Latest commit

History

Repository files navigation

files

usage

whisper model

wav2vec2 model

training

testing

quick test

programmatic testing

docker

related models

versioning

dependencies

ci workflow

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 22

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages