sophiefy / VITSView external linksLinks
ACG Text-to-Speech
☆175Nov 9, 2022Updated 3 years ago
Alternatives and similar repositories for VITS
Users that are interested in VITS are comparing it to the libraries listed below
Sorting:
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆995Mar 3, 2023Updated 2 years ago
- An unofficial implementation of the combination of Soft-VC and VITS☆456Nov 13, 2022Updated 3 years ago
- Chinese-Japanese Bilingual Text-to-Speech☆32Aug 30, 2022Updated 3 years ago
- Executable file for VITS inference☆2,412Aug 22, 2023Updated 2 years ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆161May 7, 2023Updated 2 years ago
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,396Mar 30, 2023Updated 2 years ago
- ☆625Nov 27, 2022Updated 3 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆468Nov 17, 2022Updated 3 years ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆940Dec 6, 2023Updated 2 years ago
- PJSK-Vits GUI☆103Jul 24, 2025Updated 6 months ago
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆16Jun 22, 2025Updated 7 months ago
- ☆13Mar 11, 2025Updated 11 months ago
- singing voice change based on whisper, and lora for singing voice clone☆648Nov 3, 2023Updated 2 years ago
- ☆60Jan 8, 2025Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- Soft speech units for voice conversion☆456Mar 14, 2024Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆280Jul 16, 2023Updated 2 years ago
- Fine-Tuning your VITS model using a pre-trained model☆551May 2, 2023Updated 2 years ago
- Convert Korean to Katakana☆13Dec 13, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- The better web ui for MOE-TTS☆24Nov 11, 2023Updated 2 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- GUI for MoeGoe☆572Aug 22, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆35Feb 11, 2025Updated last year
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆28Jan 9, 2026Updated last month
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- 一个第三方的泠鸢yousa歌声数据集☆17Nov 28, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 8 months ago
- text to speech using autoregressive transformer and VITS☆249Apr 3, 2024Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆70Nov 21, 2022Updated 3 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- The repository contains scripts and merge scripts that have been modified to adapt an Alpaca-Lora adapter for LoRA tuning when assuming t…☆18May 24, 2023Updated 2 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago