Open-Source Frontier Voice AI
☆37,401Apr 6, 2026Updated this week
Alternatives and similar repositories for VibeVoice
Users that are interested in VibeVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SoTA open-source TTS☆24,197Mar 26, 2026Updated 2 weeks ago
- SOTA Open Source TTS☆29,048Mar 30, 2026Updated last week
- Python tool for converting files and office documents to Markdown.☆93,259Mar 30, 2026Updated last week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,247Nov 19, 2025Updated 4 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20,400Mar 16, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,312Apr 4, 2026Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆86,467Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆63,500Updated this week
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System☆19,923Mar 16, 2026Updated 3 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,993Aug 16, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,188Apr 19, 2025Updated 11 months ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆6,272Apr 3, 2026Updated last week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆27,690Updated this week
- Towards Human-Sounding Speech☆6,068Dec 5, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 🔥 The Web Data API for AI - Power AI agents with clean web data☆104,217Apr 4, 2026Updated last week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆104,938Apr 1, 2026Updated last week
- real time face swap and one-click video deepfake with only a single image☆89,188Updated this week
- Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.☆59,774Updated this week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆183,145Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆130,242Apr 3, 2026Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆24,619Sep 12, 2025Updated 6 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆97,479Mar 27, 2026Updated 2 weeks ago
- Production-ready platform for agentic workflow development.☆135,703Apr 3, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,322Apr 1, 2026Updated last week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,737Mar 25, 2026Updated 2 weeks ago
- Simultaneous speech-to-text models☆10,036Mar 31, 2026Updated last week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆108,065Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆15,042Mar 17, 2026Updated 3 weeks ago
- 🔊 Text-Prompted Generative Audio Model☆39,076Aug 19, 2024Updated last year
- Official inference framework for 1-bit LLMs☆38,049Mar 10, 2026Updated last month
- The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling cus…☆17,877Updated this week
- Spark-TTS Inference Code☆10,957Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆168,287Updated this week
- Lightweight coding agent that runs in your terminal☆73,775Updated this week
- Universal memory layer for AI Agents☆52,137Updated this week
- 🙌 OpenHands: AI-Driven Development☆70,666Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,962Mar 4, 2026Updated last month
- The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harne…☆74,749Updated this week
- Build, run, manage agentic software at scale.☆39,153Apr 3, 2026Updated last week