fcakyon / pywhisper
openai/whisper + extra features
☆89Updated 2 years ago
Alternatives and similar repositories for pywhisper:
Users that are interested in pywhisper are comparing it to the libraries listed below
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- ☆156Updated last year
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆49Updated 2 years ago
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model☆15Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆230Updated 9 months ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- openvino version of openai/whisper☆166Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆243Updated 2 years ago
- ☆83Updated 10 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆136Updated last year
- generate granular word-level captions in srt format☆57Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- Text prompt steered synthetic audio generators☆46Updated 2 weeks ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- A quick experiment to achieve almost realtime transcription using Whisper.☆187Updated 2 years ago
- ☆36Updated 2 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- OpenAI Whisper + davinci for podcast summarization☆71Updated last year
- Coqui AI TTS plugin☆74Updated last month
- ☆147Updated last year
- Google Colab-backed Web UI for creating music with OpenAI Jukebox☆84Updated last year
- Production-ready audio and video transcription app that can run on your laptop or in the cloud.☆72Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- llama-4bit-colab☆65Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated last year