Whisper Service

An API service based on Faster-Whisper for audio transcription using OpenAI's Whisper model.

Features

REST API for audio file transcription
CUDA support for GPU acceleration
Containerized with Docker for easy deployment
Error handling and reconnection attempts

Requirements

Docker
NVIDIA GPU with CUDA support (optional, but recommended for better performance)
NVIDIA Container Toolkit (to use GPU in Docker)

Installation

With Docker

docker-compose build --no-cache
docker-compose up -d --force-recreate

Testing the service

python examples/simple_client.py path/to/your/audio.wav

Usage

Send a POST request to the /transcribe endpoint with an audio file:

import requests

url = "http://localhost:8421/transcribe"
files = {'file': open('audio.wav', 'rb')}
response = requests.post(url, files=files)
print(response.json())

Configuration

The service uses port 8421 by default. You can modify this configuration in the server.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
examples		examples
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
cuda_utils.py		cuda_utils.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
run-docker-compose.bat		run-docker-compose.bat
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper Service

Features

Requirements

Installation

With Docker

Testing the service

Usage

Configuration

About

Uh oh!

Uh oh!

Languages

zakantonio/whisper-service

Folders and files

Latest commit

History

Repository files navigation

Whisper Service

Features

Requirements

Installation

With Docker

Testing the service

Usage

Configuration

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages