Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
Alternatives and similar repositories for sp2si-code
Users that are interested in sp2si-code are comparing it to the libraries listed below
Sorting:
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Multi-voice singing voice synthesis☆238Mar 24, 2023Updated 2 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- ☆225Dec 29, 2022Updated 3 years ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆29Dec 19, 2024Updated last year
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆55Nov 20, 2024Updated last year
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Nov 11, 2021Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- FVN is now obsolete. Please use CAPRICEP instead. I will stop updating this tool. Frequency domain variants of Velvet Noise, a flexible b…☆38Aug 12, 2020Updated 5 years ago
- Singing synthesis from MIDI file☆284Jul 20, 2022Updated 3 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- parallel wavenet based on nsynth☆107Dec 14, 2018Updated 7 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- Code to train and run Blow☆145Sep 4, 2019Updated 6 years ago
- DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.☆380Jun 11, 2020Updated 5 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆18Jul 31, 2019Updated 6 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆196Nov 13, 2023Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago