mesolitica / vllm-whisper
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆17Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for vllm-whisper
- ☆54Updated this week
- ☆59Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆157Updated 8 months ago
- Putting flows on top of neural transducers for better TTS☆63Updated 3 weeks ago
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- Collection of Open Source Speech Data☆146Updated 2 weeks ago
- Tunable pipelines☆30Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆33Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆39Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆38Updated last year
- audiolm-pytorch training code☆15Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆65Updated last year
- Official Code for ParrotTTS☆42Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆44Updated 3 months ago
- VALL-E 2 reproduction☆87Updated 4 months ago
- ☆257Updated 5 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆84Updated last month
- ☆16Updated 3 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆32Updated last year
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated last year
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆232Updated 6 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆12Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago