usefulsensors / openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆65Updated last year
Related projects: ⓘ
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆148Updated 3 weeks ago
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆194Updated 2 months ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆62Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆103Updated last year
- Streaming transcriber with whisper☆685Updated last year
- openvino version of openai/whisper☆157Updated 10 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆234Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆262Updated 7 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆276Updated 3 weeks ago
- A python package to build AI-powered real-time audio applications☆992Updated 2 months ago
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆774Updated 4 months ago
- Speech-to-text server framework with next-gen Kaldi☆524Updated this week
- Efficient Inference of Transformer models☆353Updated last month
- ☆431Updated 2 months ago
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆237Updated 7 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆421Updated 10 months ago
- Pybind11 bindings for Whisper.cpp☆321Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆1,770Updated 2 weeks ago
- Suno AI's Bark model in C/C++ for fast text-to-speech☆684Updated 2 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆315Updated 7 months ago
- ☆443Updated last year
- A quick experiment to achieve almost realtime transcription using Whisper.☆185Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆551Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆878Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆3,969Updated last week
- A nearly-live implementation of OpenAI's Whisper.☆1,798Updated 2 weeks ago
- Voice activity detector (VAD) for the browser with a simple API☆773Updated last month
- ☆342Updated 6 months ago
- A voice to text keyboard based on OpenAI Whisper Model.☆45Updated last year