nari-labs / dia2Links
TTS model capable of streaming conversational audio in realtime.
☆1,027Updated 2 months ago
Alternatives and similar repositories for dia2
Users that are interested in dia2 are comparing it to the libraries listed below
Sorting:
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆336Updated this week
- Optimized Whisper models for streaming and on-device use☆811Updated this week
- ☆385Updated 2 months ago
- A TTS that fits in your CPU (and pocket)☆2,683Updated this week
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,521Updated last week
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,100Updated 2 weeks ago
- A high quality and fast TTS repository☆486Updated last month
- Controllable and fast Text-to-Speech for over 7000 languages!☆322Updated 7 months ago
- ☆502Updated this week
- Open-source framework for developing real-time multimodal conversational AI agents.☆587Updated this week
- Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smar…☆340Updated this week
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- Nanobanana fal AI powered Photoshop-esque Studio☆333Updated 2 months ago
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆233Updated 5 months ago
- An open-source implementation of Whisper☆475Updated 3 months ago
- 🚀 The Fastest Chunker in the West 🇺🇸 Upto 1TB/s "semantic" chunking, quick and easy!☆252Updated last week
- Make text LLMs listen and speak☆1,133Updated last week
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆747Updated 3 months ago
- ☆637Updated 2 months ago
- CommonForms — open models to auto-detect PDF form fields☆957Updated 2 months ago
- On-device TTS model by Neuphonic☆4,718Updated 2 weeks ago
- Enable AI models for video production in the browser☆580Updated 2 months ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,077Updated last month
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆269Updated last month
- Open Source Locally Hosted Lovable with Full Stack Support☆352Updated last month
- ☆439Updated last month
- ComfyDeployed☆439Updated 4 months ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,009Updated this week
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆382Updated last week
- A desktop app for running Large Language Models locally.☆419Updated last week