usefulsensors / openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆69Updated last year
Related projects ⓘ
Alternatives and complementary repositories for openai-whisper
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆178Updated 2 months ago
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆241Updated last week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆275Updated last week
- A python package to build AI-powered real-time audio applications☆1,090Updated 4 months ago
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆256Updated 9 months ago
- Streaming transcriber with whisper☆684Updated last year
- openvino version of openai/whisper☆161Updated last year
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆444Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆237Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆328Updated 9 months ago
- ☆459Updated 4 months ago
- Efficient Inference of Transformer models☆391Updated 3 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆114Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆259Updated last year
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆786Updated 6 months ago
- Pybind11 bindings for Whisper.cpp☆325Updated this week
- A voice to text keyboard based on OpenAI Whisper Model.☆49Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆312Updated 2 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆730Updated this week
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆401Updated 3 weeks ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆186Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated this week
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆911Updated 2 weeks ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆331Updated 10 months ago
- A nearly-live implementation of OpenAI's Whisper.☆2,060Updated 2 weeks ago
- ☆234Updated last year
- streaming speech to text server using Whisper☆83Updated last year