jjihwan / Voice-CloningLinks
Simple, Unified Repository for Retrieval-based Voice Conversion
☆17Updated last year
Alternatives and similar repositories for Voice-Cloning
Users that are interested in Voice-Cloning are comparing it to the libraries listed below
Sorting:
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 8 months ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Updated 5 months ago
- ☆10Updated 2 years ago
- Enabling the use of multiple modalities while prompting Stable Diffusion☆14Updated 3 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated last year
- ☆10Updated last year
- Diffusion Model for Voice Conversion☆17Updated 3 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆22Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆38Updated 3 weeks ago
- ☆12Updated last year
- ☆15Updated last year
- ☆20Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆18Updated 8 months ago
- Talking head animation☆27Updated last year
- Talking Face Generation system☆19Updated 2 years ago
- A curated list of resources in audio visual question answering and related area. :-)☆14Updated 3 months ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆17Updated last month
- ☆14Updated last year
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆25Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated last month
- ☆39Updated 3 months ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆11Updated 3 years ago
- Codebase and project page for EDMSound☆35Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Updated 2 years ago
- My vocoder experiments☆31Updated 2 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆46Updated last year
- Music production for silent film clips.☆28Updated 5 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 10 months ago
- A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]☆57Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆26Updated 7 months ago