fcakyon / pywhisperLinks
openai/whisper + extra features
☆89Updated 2 years ago
Alternatives and similar repositories for pywhisper
Users that are interested in pywhisper are comparing it to the libraries listed below
Sorting:
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- ☆158Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆246Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆233Updated last year
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- openvino version of openai/whisper☆168Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆54Updated 2 years ago
- Coqui AI TTS plugin☆80Updated last week
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model☆15Updated 2 years ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆55Updated last year
- ☆64Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 10 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- whisper.cpp bindings for python☆98Updated last year
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆116Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines