mesolitica / vllm-whisperView external linksLinks
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆32Jul 28, 2024Updated last year
Alternatives and similar repositories for vllm-whisper
Users that are interested in vllm-whisper are comparing it to the libraries listed below
Sorting:
- ☆14Nov 26, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆52Updated this week
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 4 months ago
- Open-source reproducible benchmarks from Argmax☆77Jan 19, 2026Updated 3 weeks ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated last month
- ☆18Sep 19, 2023Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25May 9, 2024Updated last year
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ☆30Jul 18, 2024Updated last year
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- ☆82Jan 28, 2026Updated 2 weeks ago
- ☆25Mar 6, 2024Updated last year
- ☆62Jul 25, 2024Updated last year
- MSP-Podcast Challenge Baseline Code☆30Jun 12, 2024Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- Identify speakers with stable voice timbre.☆32Jun 20, 2024Updated last year
- ☆70Sep 13, 2024Updated last year
- [ICASSP 2024] Official code for FreGrad☆35May 13, 2024Updated last year
- ☆33Jun 29, 2023Updated 2 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 8 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆76Feb 9, 2026Updated last week
- ☆30Jun 12, 2025Updated 8 months ago
- Cantonese Text to Speech with VITS implementation☆37Apr 8, 2023Updated 2 years ago
- ☆39Dec 19, 2024Updated last year
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆77May 12, 2024Updated last year
- Speech-to-text transcription VST3/ARA plugin☆53Feb 2, 2026Updated 2 weeks ago