jjihwan / Voice-CloningLinks
Simple, Unified Repository for Retrieval-based Voice Conversion
☆17Updated last year
Alternatives and similar repositories for Voice-Cloning
Users that are interested in Voice-Cloning are comparing it to the libraries listed below
Sorting:
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 11 months ago
- ☆10Updated 2 years ago
- Enabling the use of multiple modalities while prompting Stable Diffusion☆15Updated 3 years ago
- ☆10Updated last year
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Updated 8 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated 2 years ago
- Diffusion Model for Voice Conversion☆17Updated 3 years ago
- ☆14Updated 2 years ago
- Music production for silent film clips.☆31Updated 8 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Updated last year
- Sample and Computation Redistribution for Efficient Face Detection☆15Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- ☆15Updated last year
- Talking Face Generation system☆19Updated 2 years ago
- ☆24Updated 8 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Updated 11 months ago
- ☆20Updated last year
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated 11 months ago
- Talking head animation☆28Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- ☆14Updated 2 years ago
- ☆15Updated 8 months ago
- ☆28Updated 2 months ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Updated 3 years ago
- A curated list of resources in audio visual question answering and related area. :-)☆17Updated 6 months ago
- Zero-Shot Emotion Style Transfer☆49Updated 8 months ago
- ☆14Updated last year
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆24Updated 5 months ago
- Codebase and project page for EDMSound☆35Updated 2 years ago