kamepong / ACVAE-VC
☆10Updated 3 years ago
Alternatives and similar repositories for ACVAE-VC
Users that are interested in ACVAE-VC are comparing it to the libraries listed below
Sorting:
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- ☆11Updated 2 years ago
- A neural speech codec based on discrete WavLM representations☆24Updated 8 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆53Updated 6 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆24Updated last year
- ☆17Updated 10 months ago
- with alignment learning and continuous wavelet transform☆21Updated 2 years ago
- Streaming Vocos☆24Updated 4 months ago
- ☆61Updated last year
- ☆18Updated last year
- Spherical residual vector quantization (SRVQ)☆28Updated 8 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated last month
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated last month
- Implementation of Emo-StarGAN☆45Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆63Updated 9 months ago
- ☆47Updated last month
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆19Updated last year
- ☆50Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- Self-supervised Generative LM-based Voice Conversion☆36Updated 3 weeks ago
- ☆33Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆25Updated last year
- SRTNet☆24Updated 2 years ago