Instant voice cloning by MIT and MyShell. Audio foundation model.
β35,999Apr 19, 2025Updated 10 months ago
Alternatives and similar repositories for OpenVoice
Users that are interested in OpenVoice are comparing it to the libraries listed below
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,608Aug 16, 2024Updated last year
- SOTA Open Source TTSβ24,983Feb 2, 2026Updated 3 weeks ago
- π Text-Prompted Generative Audio Modelβ39,006Aug 19, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β55,240Feb 9, 2026Updated 2 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,217Dec 24, 2024Updated last year
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,463Mar 15, 2025Updated 11 months ago
- A generative speech model for daily dialogue.β38,766Jan 18, 2026Updated last month
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,696May 27, 2025Updated 9 months ago
- The ultimate space for work and life β to find, build, and collaborate with agent teammates that grow with you. We are taking agent harneβ¦β72,564Updated this week
- A natural language interface for computersβ62,427Feb 9, 2026Updated 2 weeks ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ95,206Dec 15, 2025Updated 2 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β104,246Updated this week
- Industry leading face manipulation platformβ26,919Updated this week
- Inference and training library for high-quality TTS models.β5,534Dec 10, 2024Updated last year
- Open-Sora: Democratizing Efficient Video Production for Allβ28,604Apr 30, 2025Updated 10 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β40,672Updated this week
- π OpenHands: AI-Driven Developmentβ68,154Updated this week
- real time face swap and one-click video deepfake with only a single imageβ79,673Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β163,632Updated this week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,373Dec 15, 2025Updated 2 months ago
- Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)β71,751Updated this week
- The programming language for agentic software. Build, run, and manage multi-agent systems at scale.β38,104Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,013Mar 13, 2025Updated 11 months ago
- Universal memory layer for AI Agentsβ47,994Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.β54,878Feb 21, 2026Updated last week
- Opiniated RAG for integrating GenAI in your apps π§ Focus on your product rather than the RAG. Easy integration in existing products wiβ¦β38,948Jul 9, 2025Updated 7 months ago
- We write your reusable computer vision tools. πβ36,543Updated this week
- OpenUI let's you describe UI using your imagination, then see it rendered live.β22,056Feb 11, 2026Updated 2 weeks ago
- Production-ready platform for agentic workflow development.β130,029Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,122Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,360Updated this week
- The first real AI developerβ33,798Nov 10, 2025Updated 3 months ago
- Foundational model for human-like, expressive TTSβ4,199Jul 30, 2024Updated last year
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,918Sep 30, 2025Updated 5 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β124,763Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β19,695Feb 11, 2026Updated 2 weeks ago
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,444Aug 13, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,172Aug 10, 2024Updated last year
- one-click face swapβ30,541Aug 19, 2024Updated last year