netease-youdao / EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
☆7,201Updated last month
Related projects: ⓘ
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,398Updated last month
- Brand new TTS solution☆11,190Updated this week
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,376Updated last month
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆5,259Updated 2 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆4,450Updated last week
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,637Updated 2 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆8,881Updated last month
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,202Updated 3 weeks ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆5,939Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,553Updated 7 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,459Updated 2 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆4,768Updated last week
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆10,850Updated 2 months ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,451Updated 5 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,216Updated 2 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆10,755Updated last month
- A generative speech model for daily dialogue.☆30,703Updated 2 weeks ago
- Inference and training library for high-quality TTS models.☆4,193Updated last month
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,498Updated 2 months ago
- Question and Answer based on Anything.☆11,376Updated this week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆16,112Updated last month
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆3,267Updated 3 weeks ago
- Instant voice cloning by MIT and MyShell.☆28,390Updated 3 weeks ago
- Faster Whisper transcription with CTranslate2☆11,378Updated 3 weeks ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,359Updated 2 months ago
- A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpaint…☆4,864Updated 2 months ago
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆16,959Updated this week
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆7,267Updated this week
- 🔊 Text-Prompted Generative Audio Model☆35,297Updated last month
- Next generation face swapper and enhancer☆17,808Updated this week