myshell-ai / OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
β30,574Updated 2 weeks ago
Alternatives and similar repositories for OpenVoice:
Users that are interested in OpenVoice are comparing it to the libraries listed below
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,036Updated 7 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ37,097Updated 5 months ago
- SOTA Open Source TTSβ18,589Updated last week
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β8,358Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computerβ26,680Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β5,457Updated last month
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running onβ¦β29,700Updated this week
- Industry leading face manipulation platformβ21,130Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β64,578Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,236Updated 2 months ago
- π Text-Prompted Generative Audio Modelβ36,746Updated 5 months ago
- Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.β109,942Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.β30,740Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β9,178Updated this week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ7,606Updated 5 months ago
- Self-hosted AI coding assistantβ28,942Updated this week
- Foundational model for human-like, expressive TTSβ3,996Updated 5 months ago
- Inference and training library for high-quality TTS models.β4,939Updated last month
- High-Resolution Image Synthesis with Latent Diffusion Modelsβ39,854Updated 3 months ago
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generationβ9,889Updated last month
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β21,366Updated last week
- Faster Whisper transcription with CTranslate2β13,661Updated 3 weeks ago
- Run your own AI cluster at home with everyday devices π±π» π₯οΈββ19,676Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β58,941Updated this week
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creatβ¦β24,234Updated this week
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Modelβ10,664Updated 7 months ago
- Port of OpenAI's Whisper model in C/C++β37,107Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,759Updated 11 months ago
- Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI,β¦β20,782Updated this week
- β© Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chatβ¦β21,825Updated this week