A PyTorch implementation of the universal neural vocoder
☆67Nov 6, 2020Updated 5 years ago
Alternatives and similar repositories for universal-vocoder
Users that are interested in universal-vocoder are comparing it to the libraries listed below
Sorting:
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Nov 6, 2020Updated 5 years ago
- ☆100Jul 22, 2021Updated 4 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- ☆10Apr 8, 2024Updated last year
- ☆22Aug 21, 2020Updated 5 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆11May 7, 2022Updated 3 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- ☆26Jun 5, 2024Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Jan 8, 2024Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Collect Voice Conversion researches☆96Updated this week
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆101Apr 10, 2025Updated 10 months ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆19Mar 22, 2024Updated last year
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.☆194Jun 8, 2023Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆15Sep 9, 2021Updated 4 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆130Jun 11, 2024Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- The official repository for Audio ALBERT☆67Jan 21, 2022Updated 4 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago