nari-labs / dia2Links
TTS model capable of streaming conversational audio in realtime.
☆1,007Updated last month
Alternatives and similar repositories for dia2
Users that are interested in dia2 are comparing it to the libraries listed below
Sorting:
- ☆382Updated 2 months ago
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,015Updated this week
- Optimized Whisper models for streaming and on-device use☆771Updated this week
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆677Updated last week
- ☆253Updated this week
- Open-source framework for developing real-time multimodal conversational AI agents.☆553Updated this week
- A high quality and fast TTS repository☆442Updated 2 weeks ago
- Make text LLMs listen and speak☆1,058Updated 2 weeks ago
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆738Updated 2 months ago
- VLLM Port of the Chatterbox TTS model☆359Updated 2 months ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,067Updated 3 weeks ago
- ☆635Updated 2 months ago
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆225Updated 5 months ago
- An open-source implementation of Whisper☆470Updated 2 months ago
- CommonForms — open models to auto-detect PDF form fields☆933Updated last month
- Controllable and fast Text-to-Speech for over 7000 languages!☆306Updated 6 months ago
- Enable AI models for video production in the browser☆494Updated 2 months ago
- Nanobanana fal AI powered Photoshop-esque Studio☆319Updated last month
- On-device TTS model by Neuphonic☆4,328Updated 2 weeks ago
- Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smar…☆222Updated last week
- This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models loc…☆367Updated last week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆347Updated 9 months ago
- Privacy focused AI powered meeting notes using locally hosted Small Language Models☆262Updated this week
- ☆430Updated last month
- ☆533Updated 3 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆306Updated 7 months ago
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆247Updated last week
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆331Updated this week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,556Updated last week
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆227Updated this week