carloscdias / whisper-cpp-python
whisper.cpp bindings for python
☆76Updated last year
Related projects ⓘ
Alternatives and complementary repositories for whisper-cpp-python
- Python bindings for whisper.cpp☆169Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- openvino version of openai/whisper☆161Updated last year
- A testing repo to share code and thoughts on diarisation☆51Updated 7 months ago
- streaming speech to text server using Whisper☆83Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆155Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆101Updated 9 months ago
- ☆152Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated last week
- Create an LJSpeech structured voice dataset on wave input☆19Updated last month
- ☆256Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆134Updated 3 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆237Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- Python bindings for whisper.cpp☆216Updated 5 months ago
- Pybind11 bindings for Whisper.cpp☆324Updated this week
- Speaker Diarization with Transformers☆59Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆42Updated this week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- Zero-shot Audio Classification using Whisper☆74Updated last year
- ☆86Updated 6 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆44Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆52Updated this week
- ☆347Updated 7 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆66Updated 6 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆441Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆152Updated last month
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆307Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month