Instant voice cloning by MIT and MyShell. Audio foundation model.
β36,755Apr 19, 2025Updated last year
Alternatives and similar repositories for OpenVoice
Users that are interested in OpenVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,567Aug 16, 2024Updated last year
- SOTA Open Source TTSβ30,879Jun 9, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,161Aug 19, 2024Updated last year
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,502Dec 24, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β58,698Apr 30, 2026Updated last month
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,496May 30, 2026Updated 3 weeks ago
- A generative speech model for daily dialogue.β39,469Apr 10, 2026Updated 2 months ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,842Mar 25, 2026Updated 2 months ago
- Inference and training library for high-quality TTS models.β5,581Dec 10, 2024Updated last year
- π€― LobeHub is your Chief Agent Operator, organizing your agents into 7Γ24 operations by hiring, scheduling, and reporting on your entire β¦β78,678Jun 15, 2026Updated last week
- A lightweight coding agent for open models like Deepseek, Kimi, and Qwenβ64,038Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervisionβ103,042Apr 15, 2026Updated 2 months ago
- Open-Sora: Democratizing Efficient Video Production for Allβ29,117Apr 9, 2026Updated 2 months ago
- Industry leading face manipulation platformβ28,919Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β117,348Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β43,066Updated this week
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β174,493Updated this week
- π OpenHands: AI-Driven Developmentβ77,312Updated this week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,917Mar 9, 2026Updated 3 months ago
- real time face swap and one-click video deepfake with only a single imageβ93,898Jun 14, 2026Updated last week
- Foundational model for human-like, expressive TTSβ4,204Jul 30, 2024Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,780May 18, 2026Updated last month
- Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)β72,938Updated this week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,292Aug 10, 2024Updated last year
- OpenUI let's you describe UI using your imagination, then see it rendered live.β22,404May 20, 2026Updated last month
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,377Mar 3, 2026Updated 3 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β21,667May 25, 2026Updated 3 weeks ago
- Build, run, and manage agent platforms.β40,783Updated this week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,478Aug 13, 2024Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ20,840Jun 13, 2026Updated last week
- Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experienceβ61,866Updated this week
- Production-ready platform for agentic workflow development.β145,910Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The first real AI developerβ33,740Jun 12, 2026Updated last week
- We write your reusable computer vision tools. πβ44,736Updated this week
- Opiniated RAG for integrating GenAI in your apps π§ Focus on your product rather than the RAG. Easy integration in existing products wiβ¦β39,163Jul 9, 2025Updated 11 months ago
- Universal memory layer for AI Agentsβ58,750Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β141,711Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,803Apr 8, 2026Updated 2 months ago
- PhotoMaker [CVPR 2024]β10,101Oct 31, 2024Updated last year