myshell-ai / OpenVoiceLinks
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆32,401Updated last month
Alternatives and similar repositories for OpenVoice
Users that are interested in OpenVoice are comparing it to the libraries listed below
Sorting:
- SOTA Open Source TTS☆21,227Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆40,319Updated 9 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,099Updated 5 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,270Updated 2 months ago
- 🔊 Text-Prompted Generative Audio Model☆37,876Updated 9 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆78,376Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,867Updated last year
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,097Updated this week
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆7,992Updated 9 months ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆38,945Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆26,540Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆12,013Updated last week
- Inference and training library for high-quality TTS models.☆5,261Updated 5 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆29,198Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆96,527Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,399Updated 3 weeks ago
- 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 4 / Gemini / Ollama / DeepSe…☆61,854Updated this week
- A generative speech model for daily dialogue.☆36,396Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆22,258Updated 2 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆14,104Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,750Updated 9 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,471Updated 2 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,473Updated last week
- Self-hosted AI coding assistant☆31,253Updated this week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆7,345Updated 3 weeks ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆61,918Updated this week
- Foundational model for human-like, expressive TTS☆4,129Updated 10 months ago
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆141,713Updated this week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,547Updated 5 months ago
- Real-time face swap for PC streaming or video calls☆28,679Updated 6 months ago