Instant voice cloning by MIT and MyShell. Audio foundation model.
β36,482Apr 19, 2025Updated last year
Alternatives and similar repositories for OpenVoice
Users that are interested in OpenVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,255Aug 16, 2024Updated last year
- SOTA Open Source TTSβ30,158May 4, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,105Aug 19, 2024Updated last year
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,406Dec 24, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β57,341Apr 30, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,482Mar 15, 2025Updated last year
- A generative speech model for daily dialogue.β39,237Apr 10, 2026Updated last month
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,799Mar 25, 2026Updated last month
- Inference and training library for high-quality TTS models.β5,577Dec 10, 2024Updated last year
- The ultimate space for work and life β to find, build, and collaborate with agent teammates that grow with you. We are taking agent harneβ¦β76,774Updated this week
- A natural language interface for computersβ63,389May 4, 2026Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervisionβ99,039Apr 15, 2026Updated 3 weeks ago
- Open-Sora: Democratizing Efficient Video Production for Allβ28,947Apr 9, 2026Updated last month
- Industry leading face manipulation platformβ28,145May 5, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β112,559Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β42,388May 6, 2026Updated last week
- π OpenHands: AI-Driven Developmentβ73,120Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β170,820May 6, 2026Updated last week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,732Mar 9, 2026Updated 2 months ago
- real time face swap and one-click video deepfake with only a single imageβ92,922Updated this week
- Foundational model for human-like, expressive TTSβ4,197Jul 30, 2024Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,442Apr 20, 2026Updated 3 weeks ago
- Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)β72,561Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,247Aug 10, 2024Updated last year
- OpenUI let's you describe UI using your imagination, then see it rendered live.β22,286Feb 11, 2026Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,263Mar 3, 2026Updated 2 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β20,966May 3, 2026Updated last week
- Build, run, and manage agent platforms.β40,013Updated this week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,478Aug 13, 2024Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,707Apr 24, 2026Updated 2 weeks ago
- Production-ready platform for agentic workflow development.β140,588Updated this week
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.β59,768Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- We write your reusable computer vision tools. πβ38,429May 6, 2026Updated last week
- The first real AI developerβ33,778Apr 17, 2026Updated 3 weeks ago
- Opiniated RAG for integrating GenAI in your apps π§ Focus on your product rather than the RAG. Easy integration in existing products wiβ¦β39,138Jul 9, 2025Updated 10 months ago
- Universal memory layer for AI Agentsβ55,385Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β136,384Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,776Apr 8, 2026Updated last month
- PhotoMaker [CVPR 2024]β10,108Oct 31, 2024Updated last year