sophiefy / VITS
ACG Text-to-Speech
☆178Updated 2 years ago
Alternatives and similar repositories for VITS:
Users that are interested in VITS are comparing it to the libraries listed below
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated last year
- A convenient tool for generating audio files☆135Updated 2 years ago
- Tacotron2 implementation of Japanese☆270Updated 2 years ago
- async http process VST plugin☆162Updated last year
- GUI for MoeGoe☆568Updated last year
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆977Updated last year
- ☆597Updated 2 years ago
- 一个使用OpenAI接口链接VITS模型的语音对话系统GUI☆104Updated 2 years ago
- Fine-Tuning your VITS model using a pre-trained model☆553Updated last year
- VITS web UI☆44Updated last year
- MoeGoe Android Application by calling Azure function API☆58Updated 2 years ago
- VitsWebUi☆34Updated 2 years ago
- Extract the voice and corresponding text☆74Updated this week
- An auxiliary tool for manual screening of audio dataset.☆121Updated last year
- OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器,使用方法:https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9…☆23Updated last year
- An unofficial implementation of the combination of Soft-VC and VITS☆458Updated 2 years ago
- ☆390Updated last year
- すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files…☆47Updated last year
- MoeGoe Azure Cloud Function API☆51Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Updated 2 years ago
- Deep-learning-based voice changer, supporting local inference.☆96Updated 2 years ago
- A Japanese G2P tool based on pyopenjtalk☆24Updated 2 years ago
- ☆275Updated 4 months ago
- Chinese-Japanese Bilingual Text-to-Speech☆31Updated 2 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆473Updated 2 years ago