usefulsensors / openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆72Updated last year
Alternatives and similar repositories for openai-whisper:
Users that are interested in openai-whisper are comparing it to the libraries listed below
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆203Updated 5 months ago
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆318Updated this week
- A python package to build AI-powered real-time audio applications☆1,153Updated 6 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Streaming transcriber with whisper☆686Updated last year
- openvino version of openai/whisper☆164Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆238Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆297Updated 2 months ago
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆287Updated 3 weeks ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆352Updated 5 months ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆65Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆471Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- Speech-to-text server framework with next-gen Kaldi☆594Updated this week
- ☆487Updated 6 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆276Updated last year
- Efficient Inference of Transformer models☆414Updated 5 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆970Updated 3 weeks ago
- A voice to text keyboard based on OpenAI Whisper Model.☆50Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆574Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio files☆320Updated 2 years ago
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆804Updated 9 months ago
- Pybind11 bindings for Whisper.cpp☆328Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆2,367Updated 3 weeks ago
- A nearly-live implementation of OpenAI's Whisper.☆2,334Updated last week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆203Updated 2 months ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆345Updated last year
- ☆548Updated 8 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆187Updated 2 years ago