a43992899 / DeID-VC
Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion
☆13Updated last year
Alternatives and similar repositories for DeID-VC:
Users that are interested in DeID-VC are comparing it to the libraries listed below
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Updated 6 years ago
- SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-ModelsEvaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆16Updated last year
- Voice Alignment and Conversion with Neural Networks and the WORLD codec.☆20Updated 5 years ago
- ☆22Updated 4 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 4 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆83Updated last year
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated 2 years ago
- ☆87Updated 2 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆75Updated 3 months ago
- ☆30Updated 5 months ago
- ☆45Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- RepVgg + HiFiGAN☆34Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 3 years ago
- GAN series for voice conversion on VCC2018 dataset☆16Updated 4 years ago
- ☆65Updated last year
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 4 years ago