myshell-ai / OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆32,029Updated 2 weeks ago
Alternatives and similar repositories for OpenVoice:
Users that are interested in OpenVoice are comparing it to the libraries listed below
- 🔊 Text-Prompted Generative Audio Model☆37,629Updated 8 months ago
- SOTA Open Source TTS☆20,921Updated 3 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,000Updated 4 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,253Updated last month
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆7,941Updated 8 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆39,745Updated 8 months ago
- Industry leading face manipulation platform☆22,700Updated last week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,017Updated 3 weeks ago
- A server software for Minecraft: Bedrock Edition in PHP☆2Updated 4 years ago
- Inference and training library for high-quality TTS models.☆5,219Updated 4 months ago
- Self-hosted AI coding assistant☆31,021Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13,515Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆11,659Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,307Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,500Updated 5 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆75,825Updated this week
- OpenUI let's you describe UI using your imagination, then see it rendered live.☆21,061Updated 2 weeks ago
- 🙌 OpenHands: Code Less, Make More☆53,709Updated this week
- Foundational model for human-like, expressive TTS☆4,104Updated 9 months ago
- Python scraper based on AI☆19,425Updated last week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆43,502Updated this week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,467Updated 4 months ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆37,357Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,686Updated 8 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆8,120Updated this week
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆43,430Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,917Updated last month
- Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, …☆25,188Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,856Updated last year
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆21,069Updated last week