🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
☆1,383Feb 3, 2026Updated last month
Alternatives and similar repositories for Speech-AI-Forge
Users that are interested in Speech-AI-Forge are comparing it to the libraries listed below
Sorting:
- 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。☆2,577Jul 2, 2024Updated last year
- 官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project☆1,850Jul 3, 2024Updated last year
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听 🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆709Jul 2, 2024Updated last year
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,518Dec 5, 2025Updated 2 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆463Nov 7, 2024Updated last year
- ChatTTS资源大全,免费体验地址,音色库等☆1,240Jun 12, 2024Updated last year
- A generative speech model for daily dialogue.☆38,766Jan 18, 2026Updated last month
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,786Feb 11, 2026Updated 3 weeks ago
- 使用CHATTTS合成语音,使用FASTAPI作为API服务端,基于GFAST制作了管理系统,提供了音色管理和webui界面☆35Jun 14, 2024Updated last year
- SOTA Open Source TTS☆25,078Feb 2, 2026Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,122Updated this week
- Multilingual Voice Understanding Model☆7,611Dec 30, 2025Updated 2 months ago
- ☆765Jun 24, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆55,429Feb 9, 2026Updated 3 weeks ago
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆5,364Jul 11, 2025Updated 7 months ago
- 实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…☆1,211Dec 18, 2025Updated 2 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆10,108Dec 12, 2025Updated 2 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,036Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆7,232Dec 24, 2024Updated last year
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆217Jul 18, 2024Updated last year
- Translate the video from one language to another and embed dubbing & subtitles.☆16,324Updated this week
- ☆1,530Jun 14, 2024Updated last year
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆4,184Aug 5, 2025Updated 6 months ago
- 利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.☆8,170Jan 9, 2026Updated last month
- An Open-Sourced LLM-empowered Foundation TTS System☆903Sep 28, 2025Updated 5 months ago
- 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”☆3,067Mar 5, 2025Updated 11 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,706May 27, 2025Updated 9 months ago
- Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema☆2,313Aug 1, 2025Updated 7 months ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,605Aug 15, 2024Updated last year
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆8,450Aug 13, 2024Updated last year
- 一个超轻量级、可以在移动端实时运行的数字人模型☆2,421Sep 18, 2025Updated 5 months ago
- zero-shot voice conversion & singing voice conversion, with real-time support☆3,601Apr 20, 2025Updated 10 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,025Apr 19, 2025Updated 10 months ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆10,526Updated this week
- vits2 backbone with multilingual-bert☆8,692Feb 23, 2026Updated last week
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆1,240Jun 29, 2025Updated 8 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,019Jul 2, 2024Updated last year
- CosyVoice在Windows环境下使用的版本☆754Nov 19, 2024Updated last year
- Spark-TTS Inference Code☆10,943Apr 9, 2025Updated 10 months ago