thepowerfuldeez / rvc-trainerLinks
☆10Updated last year
Alternatives and similar repositories for rvc-trainer
Users that are interested in rvc-trainer are comparing it to the libraries listed below
Sorting:
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 10 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆45Updated 2 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆155Updated 7 months ago
- DiffSinger training colab notebook to make training easier hopefully☆49Updated 4 months ago
- Your one-stop solution for voice dataset creation☆127Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆80Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆85Updated 4 months ago
- Codename's rvc fork version 4, based on Applio.☆29Updated this week
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated last year
- SoTA open-source TTS☆114Updated 5 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 9 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆160Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆70Updated 3 weeks ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆51Updated 2 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 3 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 weeks ago
- StyleTTS 2 Optimized Training Fork☆34Updated 9 months ago
- ☆24Updated 6 months ago
- AudioSR-Upsampling (any -> 48kHz)☆43Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆186Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆89Updated 10 months ago
- Create training data for training a voice cloner for bark text to speech.☆47Updated 2 years ago
- VALL-E 2 reproduction☆132Updated last year
- Community framework for training tortoise☆44Updated 3 years ago
- Zero-Shot Emotion Style Transfer☆49Updated 7 months ago
- a Frontier Japanese Speech Generation net☆57Updated 6 months ago