Whisper GUI

A lightweight alternative to Windows H (Windows Speech Recognition) that provides more flexibility and language support using OpenAI's Whisper model.

Overview

This tool is designed to replace the built-in Windows H speech recognition feature, offering enhanced capabilities for multilingual speech-to-text transcription. It's particularly useful for users who need to transcribe speech in multiple languages (English, Spanish, etc.) with high accuracy.

Features

🎤 Simple one-click recording interface
🌍 Multi-language support (English, Spanish, and more)
📝 Real-time transcription
🎯 High accuracy using OpenAI's Whisper model
🖥️ Clean, simple interface

Demo

Here's a screenshot of the application interface:

Installation

Option 1: Download Pre-built Executable

Go to the Releases page
Download the latest version of whisper-gui.exe
Run the executable

Option 2: Build from Source

Clone the repository:

git clone https://github.com/elpargo/whisper-windows-gui/releases
cd whisper-gui

Create and activate a virtual environment:

python -m venv venv
.\venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Build the executable:
```
.\build.ps1
```

Usage

Run the application
Click the microphone button (or press Space/Enter) to start recording
Speak in your desired language
Click the button again (or press Space/Enter) to stop recording
The transcription will appear in the text area
Click "Save" to save the transcription to a file

Note: You can use either the Space or Enter key interchangeably to start/stop recording.

Why Whisper GUI?

While Windows H provides basic speech recognition, it has limitations:

Limited language support
Requires internet connection
Less accurate for non-English languages
No easy way to save transcriptions

Whisper GUI addresses these issues by:

Supporting multiple languages
Working offline
Providing higher accuracy
Offering easy transcription saving

Technical Details

Built with Python and PyQt6
Uses OpenAI's Whisper model for transcription
Compiled with PyInstaller for easy distribution

License

This project is released under the MIT License. See the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Acknowledgments

OpenAI Whisper for the speech recognition model
PyQt6 for the GUI framework

TODOs

💾 Save transcriptions to text files
Invoke on global OS keybinding (ie: replace windows + H entirely)
output to text input field directly (Also a windows + H feature)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
docs		docs
icons		icons
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.bat		build.bat
build.ps1		build.ps1
create_icons.py		create_icons.py
requirements.txt		requirements.txt
whisper_cli.py		whisper_cli.py
whisper_cli.spec		whisper_cli.spec
whisper_gui.py		whisper_gui.py
whisper_gui.spec		whisper_gui.spec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Whisper GUI

Overview

Features

Demo

Installation

Option 1: Download Pre-built Executable

Option 2: Build from Source

Usage

Why Whisper GUI?

Technical Details

License

Contributing

Acknowledgments

TODOs

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

elpargo/whisper-windows-gui

Folders and files

Latest commit

History

Repository files navigation

Whisper GUI

Overview

Features

Demo

Installation

Option 1: Download Pre-built Executable

Option 2: Build from Source

Usage

Why Whisper GUI?

Technical Details

License

Contributing

Acknowledgments

TODOs

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages