allenai / OLMoASRLinks
An open-source implementation of Whisper
☆477Updated 3 months ago
Alternatives and similar repositories for OLMoASR
Users that are interested in OLMoASR are comparing it to the libraries listed below
Sorting:
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆307Updated 8 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆388Updated 2 weeks ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆295Updated 8 months ago
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆694Updated last week
- ☆261Updated 8 months ago
- ☆245Updated last month
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆185Updated 3 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆203Updated 3 weeks ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆604Updated 3 weeks ago
- A highly compressive and high-quality neural audio codec for speech models.☆250Updated 2 weeks ago
- ☆580Updated 3 weeks ago
- DACVAE☆191Updated last month
- A high quality and fast TTS repository☆498Updated last month
- ☆442Updated 2 months ago
- ☆511Updated last week
- VLLM Port of the Chatterbox TTS model☆365Updated 3 months ago
- Kyutai with an "eye"☆236Updated 10 months ago
- TTS model capable of streaming conversational audio in realtime.☆1,051Updated 2 months ago
- ☆386Updated 3 months ago
- Fast audio super resolution from 16khz to 48khz.☆192Updated last month
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆348Updated 9 months ago
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆301Updated 4 months ago
- ☆370Updated 4 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆569Updated 2 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Train your own speech AI model from scratch☆146Updated this week
- Streaming and Fine-tuning for Chatterbox TTS☆267Updated 7 months ago
- Optimized Whisper models for streaming and on-device use☆816Updated this week
- ☆637Updated 3 months ago
- ~950 line, minimal, extensible LLM inference engine built from scratch.☆406Updated last month