nadare881 / voras-webui-beta
liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project
☆33Updated last year
Alternatives and similar repositories for voras-webui-beta:
Users that are interested in voras-webui-beta are comparing it to the libraries listed below
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆23Updated last year
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆32Updated 6 months ago
- a Frontier Japanese Speech Generation net☆19Updated 2 weeks ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆52Updated last year
- 日本語TTS(VITS)の学習と音声合成のGradio WebUI☆42Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆67Updated last year
- Misc. tools/scripts that I made to use for tortoise☆22Updated 6 months ago
- DiffSinger training colab notebook to make training easier hopefully☆38Updated last month
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆141Updated last year
- ☆26Updated 7 months ago
- List of repositories relevant to VITS.☆36Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 9 months ago
- VITS2 using Phoneme-Level Japanese BERT☆13Updated last year
- RTVC: Real-Time Voice Conversion GUI☆53Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆31Updated last month
- ☆13Updated 8 months ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆14Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- Ultimate Vocal Remover CLI type for Google Colab☆50Updated last month
- RVC Inference with multiple model and huggingface support☆103Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆122Updated 3 months ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- 44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…☆36Updated last year
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆19Updated last year
- Real-time end-to-end singing voice convertion☆19Updated 4 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆99Updated last month
- ☆25Updated 11 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 6 months ago
- ☆28Updated last year
- SOFA: Singing-Oriented Forced Aligner☆149Updated 2 weeks ago