nari-labs / dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆7,495Updated this week
Alternatives and similar repositories for dia:
Users that are interested in dia are comparing it to the libraries listed below
- Towards Human-Sounding Speech☆4,490Updated last week
- A Conversational Speech Generation Model☆12,768Updated 3 weeks ago
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl☆5,435Updated 2 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,660Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆11,448Updated last week
- Suna - Open Source Generalist AI Agent☆1,517Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,128Updated this week
- Open Source framework for voice and multimodal conversational AI☆5,709Updated this week
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,842Updated 2 weeks ago
- The python library for real-time communication☆3,750Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆2,432Updated 2 weeks ago
- ☆10,430Updated last week
- An AI web browsing framework focused on simplicity and extensibility.☆11,127Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,041Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆11,889Updated this week
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆8,706Updated this week
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆6,452Updated last month
- The official Python SDK for Model Context Protocol servers and clients☆10,629Updated this week
- Fully local web research and report writing assistant☆7,099Updated last month
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆2,457Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆5,252Updated this week
- A fast multimodal LLM for real-time voice☆3,855Updated 2 months ago
- ☆4,821Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,687Updated 2 months ago
- A lightweight, powerful framework for multi-agent workflows☆9,339Updated this week
- TTS with kokoro and onnx runtime☆1,917Updated 2 weeks ago
- ☆4,942Updated 2 weeks ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,239Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆37,004Updated this week
- Run AI Agent in your browser.☆12,341Updated 2 weeks ago