netease-youdao / EmotiVoiceView external linksLinks
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
β8,426Aug 13, 2024Updated last year
Alternatives and similar repositories for EmotiVoice
Users that are interested in EmotiVoice are comparing it to the libraries listed below
Sorting:
- SOTA Open Source TTSβ24,863Feb 2, 2026Updated last week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,516Aug 16, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,918Apr 19, 2025Updated 9 months ago
- A generative speech model for daily dialogue.β38,696Jan 18, 2026Updated 3 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,186Dec 24, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β54,918Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β19,578Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,162Aug 10, 2024Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,956Feb 11, 2024Updated 2 years ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,687May 27, 2025Updated 8 months ago
- π Text-Prompted Generative Audio Modelβ38,970Aug 19, 2024Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activityβ¦β14,891Feb 4, 2026Updated last week
- Inference and training library for high-quality TTS models.β5,528Dec 10, 2024Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wildβ7,212Aug 5, 2024Updated last year
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API keyβ9,979Dec 12, 2025Updated 2 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,745Nov 14, 2024Updated last year
- Draw a mockup and generate html for itβ13,604Jul 26, 2025Updated 6 months ago
- Multilingual Voice Understanding Modelβ7,497Dec 30, 2025Updated last month
- vits2 backbone with multilingual-bertβ8,687Updated this week
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.β9,501Jun 6, 2025Updated 8 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,079Updated this week
- [CVPR 2023] SadTalkerοΌLearning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animationβ13,587Jun 26, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β22,993Mar 13, 2025Updated 11 months ago
- Industry leading face manipulation platformβ26,787Updated this week
- Question and Answer based on Anything.β13,859Mar 24, 2025Updated 10 months ago
- A sound cloning tool with a web interface, using your voice or any sound to record audio / δΈδΈͺεΈ¦webηι’ηε£°ι³ε ιε·₯ε ·οΌδ½Ώη¨δ½ ηι³θ²ζδ»»ζε£°ι³ζ₯ε½εΆι³ι’β8,908Aug 29, 2025Updated 5 months ago
- Faster Whisper transcription with CTranslate2β20,833Nov 19, 2025Updated 2 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,465Mar 15, 2025Updated 11 months ago
- Foundational model for human-like, expressive TTSβ4,190Jul 30, 2024Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,555Dec 14, 2025Updated 2 months ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontenβ¦β12,530Jan 27, 2026Updated 2 weeks ago
- PhotoMaker [CVPR 2024]β10,118Oct 31, 2024Updated last year
- Translate the video from one language to another and embed dubbing & subtitles.β16,150Updated this week
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"β10,906Aug 29, 2025Updated 5 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animationβ5,020Jul 2, 2024Updated last year
- The ultimate space for work and life β to find, build, and collaborate with agent teammates that grow with you. We are taking agent harneβ¦β72,187Updated this week
- π· EasyPhoto | Your Smart AI Photo Generator.β5,185Jul 10, 2024Updated last year
- πClone a voice in 5 seconds to generate arbitrary speech in real-timeβ36,876Jan 7, 2026Updated last month
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.β5,341Jul 11, 2025Updated 7 months ago