nari-labs / dia2Links
TTS model capable of streaming conversational audio in realtime.
☆920Updated 3 weeks ago
Alternatives and similar repositories for dia2
Users that are interested in dia2 are comparing it to the libraries listed below
Sorting:
- ☆370Updated last month
- Lightning-Fast, On-Device TTS — running natively via ONNX.☆1,844Updated this week
- Open-source framework for developing real-time multimodal conversational AI agents.☆548Updated this week
- An open-source implementation of Whisper☆469Updated last month
- Optimized Whisper models for streaming and on-device use☆765Updated this week
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆723Updated 2 months ago
- Make text LLMs listen and speak☆1,028Updated last week
- CommonForms — open models to auto-detect PDF form fields☆923Updated 3 weeks ago
- VLLM Port of the Chatterbox TTS model☆351Updated 2 months ago
- ☆634Updated last month
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆217Updated 4 months ago
- On-device TTS model by Neuphonic☆4,273Updated this week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆298Updated 6 months ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,062Updated this week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆346Updated 8 months ago
- ☆532Updated 2 months ago
- Enable AI models for video production in the browser☆487Updated last month
- ComfyDeployed☆435Updated 3 months ago
- Maivi - My AI Voice Input: Real-time voice-to-text local on cpu better than whisper with hotkey support☆256Updated 2 months ago
- ☆424Updated 2 weeks ago
- Laddr is a python framework for building multi-agent systems where agents communicate, delegate tasks, and execute work in parallel. Thin…☆272Updated 3 weeks ago
- Build an AI Telephony Agent for Inbound and Outbound Calls☆225Updated 2 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆298Updated 2 months ago
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,467Updated this week
- Open Source Locally Hosted Lovable with Full Stack Support☆276Updated last week
- PageLM is a community driven version of NotebookLM & a education platform that transforms study materials into interactive resources like…☆840Updated 2 weeks ago
- ☆70Updated 4 months ago
- mem-agent mcp server☆588Updated last month
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆228Updated 4 months ago
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆219Updated last week