ciaua / score_lyrics_free_svg
Score- and Lyrics-Free Singing Voice Generation
☆28Updated 4 years ago
Alternatives and similar repositories for score_lyrics_free_svg
Users that are interested in score_lyrics_free_svg are comparing it to the libraries listed below
Sorting:
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated last year
- ☆26Updated 4 years ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆22Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- Based on https://github.com/fatchord/WaveRNN☆24Updated 5 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- ICASSP 2022☆61Updated 3 years ago
- ☆16Updated 3 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 9 months ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 2 years ago
- ☆15Updated 2 years ago
- Temporary anonymous version☆22Updated last year
- Alignment examples for Interspeech 2024☆20Updated 10 months ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Updated last year
- ☆10Updated last year
- ☆34Updated 5 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆13Updated 2 years ago
- ☆19Updated 2 years ago
- with alignment learning and continuous wavelet transform☆21Updated 2 years ago
- ☆41Updated 2 years ago
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago