Vaibhavs10 / translate-with-whisper
☆152Updated last year
Related projects ⓘ
Alternatives and complementary repositories for translate-with-whisper
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- ☆256Updated 5 months ago
- whisper.cpp bindings for python☆77Updated last year
- Video+code lecture on building nanoGPT from scratch☆64Updated 5 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆84Updated last month
- ☆53Updated 3 weeks ago
- ☆61Updated 3 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 3 weeks ago
- ☆253Updated 8 months ago
- Speaker Diarization with Transformers☆59Updated 6 months ago
- ☆87Updated 6 months ago
- ☆191Updated 5 months ago
- ASR + diarization model server with speculative decoding☆50Updated 5 months ago
- Collection of Open Source Speech Data☆146Updated last week
- Transcription with speaker diarization pipeline☆86Updated last year
- Enhancing Translation with RAG-Powered Large Language Models☆65Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆103Updated 9 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆50Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago
- ☆307Updated 2 months ago
- ☆347Updated 8 months ago
- A ggml (C++) re-implementation of tortoise-tts☆159Updated 3 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆138Updated 4 months ago
- Finetune VITS and MMS using HuggingFace's tools☆122Updated 7 months ago
- ☆253Updated 5 months ago