nari-labs / dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆15,558Updated this week
Alternatives and similar repositories for dia
Users that are interested in dia are comparing it to the libraries listed below
Sorting:
- Towards Human-Sounding Speech☆4,750Updated last week
- Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme☆5,227Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆12,381Updated this week
- Suna - Open Source Generalist AI Agent☆10,978Updated this week
- Have a natural, spoken conversation with AI!☆2,139Updated last week
- Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.☆13,666Updated this week
- Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors☆7,579Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,124Updated this week
- A Conversational Speech Generation Model☆13,220Updated last month
- Collection of leaked system prompts☆7,586Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆2,777Updated 2 weeks ago
- Pocket Flow: 100-line LLM framework. Let Agents build Agents!☆4,546Updated last week
- Run AI Agent in your browser.☆12,919Updated last week
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆6,577Updated 2 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,001Updated this week
- ☆5,266Updated last week
- 🚀 The fast, Pythonic way to build MCP servers and clients☆9,835Updated this week
- Open Source framework for voice and multimodal conversational AI☆6,065Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,830Updated 2 weeks ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,334Updated this week
- An open protocol enabling communication and interoperability between opaque agentic applications.☆15,464Updated this week
- A powerful framework for building realtime voice AI agents 🤖🎙️📹☆5,931Updated this week
- The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essent…☆8,623Updated 3 weeks ago
- Fully local web research and report writing assistant☆7,348Updated last month
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆22,551Updated this week
- The python library for real-time communication☆3,891Updated this week
- 📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your c…☆15,905Updated this week
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆3,500Updated last week
- Official repository for LTX-Video☆5,436Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆2,746Updated last week