fishaudio / Bert-VITS2
vits2 backbone with multilingual-bert
☆7,972Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Bert-VITS2
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,744Updated 4 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆35,258Updated this week
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,779Updated 3 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆6,138Updated this week
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆6,186Updated 2 months ago
- Easily train a good VC model with voice data <= 10 mins!☆24,296Updated 2 months ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,653Updated 6 months ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆1,873Updated last week
- SoftVC VITS Singing Voice Conversion☆25,827Updated 11 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆6,829Updated 11 months ago
- AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/…☆3,046Updated this week
- So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document☆672Updated 3 months ago
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,326Updated last year
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,711Updated this week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆6,189Updated last week
- Brand new TTS solution☆14,138Updated this week
- A generative speech model for daily dialogue.☆32,179Updated this week
- LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.☆4,578Updated last week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,455Updated 2 months ago
- A simple GUI application that slices audio with silence detection☆1,235Updated 3 months ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,303Updated last year
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆3,615Updated 2 months ago
- Yet another voice assistant, but alive.☆2,464Updated 11 months ago
- Multilingual Voice Understanding Model☆3,349Updated 3 weeks ago
- A simple VITS HTTP API, developed by extending Moegoe with additional features.☆810Updated last week
- 官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project☆1,196Updated 4 months ago
- Genshin Datasets For SVC/SVS/TTS☆592Updated last month
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆6,853Updated this week
- Executable file for VITS inference☆2,349Updated last year
- a TTS demo for training new characters.☆437Updated 10 months ago