Instant voice cloning by MIT and MyShell. Audio foundation model.
β36,049Apr 19, 2025Updated 10 months ago
Alternatives and similar repositories for OpenVoice
Users that are interested in OpenVoice are comparing it to the libraries listed below
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,763Aug 16, 2024Updated last year
- SOTA Open Source TTSβ25,154Mar 5, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,043Aug 19, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β55,605Feb 9, 2026Updated last month
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,242Dec 24, 2024Updated last year
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,466Mar 15, 2025Updated 11 months ago
- A generative speech model for daily dialogue.β38,905Jan 18, 2026Updated last month
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,709May 27, 2025Updated 9 months ago
- The ultimate space for work and life β to find, build, and collaborate with agent teammates that grow with you. We are taking agent harneβ¦β73,318Updated this week
- A natural language interface for computersβ62,652Feb 9, 2026Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervisionβ95,527Dec 15, 2025Updated 2 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β105,651Updated this week
- Industry leading face manipulation platformβ26,995Mar 5, 2026Updated last week
- Inference and training library for high-quality TTS models.β5,547Dec 10, 2024Updated last year
- π OpenHands: AI-Driven Developmentβ68,865Updated this week
- Open-Sora: Democratizing Efficient Video Production for Allβ28,658Apr 30, 2025Updated 10 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β40,987Updated this week
- real time face swap and one-click video deepfake with only a single imageβ79,950Mar 6, 2026Updated last week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β164,248Mar 6, 2026Updated last week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,512Updated this week
- Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)β71,877Updated this week
- Build, run, manage agentic software at scale.β38,516Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,048Mar 3, 2026Updated last week
- Universal memory layer for AI Agentsβ49,365Updated this week
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configration.β55,868Updated this week
- Opiniated RAG for integrating GenAI in your apps π§ Focus on your product rather than the RAG. Easy integration in existing products wiβ¦β38,974Jul 9, 2025Updated 8 months ago
- We write your reusable computer vision tools. πβ36,654Mar 3, 2026Updated last week
- OpenUI let's you describe UI using your imagination, then see it rendered live.β22,081Feb 11, 2026Updated last month
- Production-ready platform for agentic workflow development.β131,572Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,431Mar 1, 2026Updated last week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,169Mar 4, 2026Updated last week
- The first real AI developerβ33,810Nov 10, 2025Updated 4 months ago
- Foundational model for human-like, expressive TTSβ4,200Jul 30, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β19,913Feb 11, 2026Updated last month
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,985Sep 30, 2025Updated 5 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β126,337Updated this week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,454Aug 13, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,196Aug 10, 2024Updated last year
- one-click face swapβ30,547Aug 19, 2024Updated last year