esnya / hf-rvc
Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.
☆68Updated last year
Alternatives and similar repositories for hf-rvc:
Users that are interested in hf-rvc are comparing it to the libraries listed below
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆22Updated last year
- Advanced RVC Inference for quicker and effortless model downloads☆50Updated last month
- RTVC: Real-Time Voice Conversion GUI☆55Updated last year
- RVC Inference with multiple model and huggingface support☆104Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆144Updated last year
- Your one-stop solution for voice dataset creation☆119Updated last year
- Ultimate Vocal Remover CLI type for Google Colab☆52Updated last week
- ☆17Updated 11 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- High performance RVC inferencing, intended for multiple instances in memory at once. Also includes the latest pitch estimator RMVPE, Pyth…☆26Updated last year
- DiffSinger training colab notebook to make training easier hopefully☆42Updated this week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆176Updated 7 months ago
- Audio datasets, easier.☆84Updated last year
- ☆99Updated 8 months ago
- Text to Speech using Coqui TTS + RVC☆101Updated last year
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆238Updated last year
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆122Updated last month
- TorToiSe fine-tuning with DLAS☆220Updated 9 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆67Updated last year
- List of repositories relevant to VITS.☆36Updated 2 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆190Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- ☆255Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- ☆66Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆63Updated last year
- ☆134Updated 2 months ago
- Speech AI training and inference tools☆36Updated last year