mesolitica / vllm-whisperLinks
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆31Updated last year
Alternatives and similar repositories for vllm-whisper
Users that are interested in vllm-whisper are comparing it to the libraries listed below
Sorting:
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 11 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆126Updated 3 months ago
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- openvino version of openai/whisper☆175Updated last year
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- ☆126Updated 3 weeks ago
- Open TTS models, built for streaming on the edge☆42Updated 5 months ago
- Official implementation of the TTS model Lina-Speech☆168Updated 8 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆123Updated last month
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- Open-source reproducible benchmarks from Argmax☆57Updated last week
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆63Updated last month
- An unofficial PyTorch implementation of VALL-E☆88Updated last month
- Audio tokenization, in the fastest way possible!☆52Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆176Updated last year
- ☆40Updated this week
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- whisper.cpp bindings for python☆102Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- Very fast, accurate speaker diarization☆87Updated this week
- ☆19Updated 6 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated 2 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Collection of Open Source Speech Data☆160Updated 10 months ago
- ONNX implementation of Whisper. PyTorch free.☆99Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated this week