revsic / torch-retriever-vcView external linksLinks
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
Alternatives and similar repositories for torch-retriever-vc
Users that are interested in torch-retriever-vc are comparing it to the libraries listed below
Sorting:
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Jul 17, 2021Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- ☆15May 8, 2021Updated 4 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Oct 19, 2022Updated 3 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- ☆64May 23, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆22Jul 30, 2025Updated 6 months ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆11May 7, 2022Updated 3 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 7 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 7 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Feb 27, 2021Updated 4 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- ☆55Aug 11, 2022Updated 3 years ago