jjihwan / Voice-CloningLinks
Simple, Unified Repository for Retrieval-based Voice Conversion
☆17Updated last year
Alternatives and similar repositories for Voice-Cloning
Users that are interested in Voice-Cloning are comparing it to the libraries listed below
Sorting:
- ☆10Updated last year
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 7 months ago
- ☆10Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated last year
- A curated list of resources in audio visual question answering and related area. :-)☆12Updated 2 months ago
- ☆15Updated last year
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Updated 4 months ago
- Enabling the use of multiple modalities while prompting Stable Diffusion☆15Updated 2 years ago
- ☆11Updated last year
- ☆37Updated last month
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆22Updated 5 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆22Updated last year
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆25Updated last year
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆78Updated 2 months ago
- ☆14Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- Music production for silent film clips.☆27Updated 4 months ago
- Diffusion Model for Voice Conversion☆17Updated 2 years ago
- ☆14Updated 2 years ago
- My vocoder experiments☆31Updated last month
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated 2 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆18Updated 6 months ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Updated 3 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆30Updated last week
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- Talking head animation☆27Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆22Updated 7 months ago
- ☆14Updated last year