jjihwan / Voice-CloningLinks
Simple, Unified Repository for Retrieval-based Voice Conversion
☆17Updated last year
Alternatives and similar repositories for Voice-Cloning
Users that are interested in Voice-Cloning are comparing it to the libraries listed below
Sorting:
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated last year
- ☆10Updated 2 years ago
- ☆11Updated last year
- Diffusion Model for Voice Conversion☆17Updated 3 years ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Updated 9 months ago
- ☆20Updated last year
- ☆14Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated 2 years ago
- Enabling the use of multiple modalities while prompting Stable Diffusion☆15Updated 3 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last week
- ☆14Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 11 months ago
- ☆24Updated 8 months ago
- ☆41Updated 6 months ago
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated 11 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Updated 11 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Updated 3 weeks ago
- Indic-Conformer models for ASR☆20Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆46Updated 4 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Updated last year
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆10Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Updated 5 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Updated last year
- My vocoder experiments☆31Updated 6 months ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated 2 years ago
- Music production for silent film clips.☆32Updated 9 months ago