karchkha / MelSpec_GPT_VQVAELinks
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
Alternatives and similar repositories for MelSpec_GPT_VQVAE
Users that are interested in MelSpec_GPT_VQVAE are comparing it to the libraries listed below
Sorting:
- 60k hours of phoneme-aligned audio from audio books☆18Updated 11 months ago
- ☆15Updated 4 years ago
- ☆11Updated 3 years ago
- ☆41Updated 2 years ago
- ☆16Updated 3 years ago
- Temporary anonymous version☆22Updated last year
- ☆25Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 3 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 4 years ago
- Digital Speech Processing in PyTorch.☆14Updated 2 years ago
- ☆13Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24Updated 5 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Updated 2 years ago
- ☆25Updated 2 years ago
- ☆19Updated 2 years ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆33Updated last week
- Alignment examples for Interspeech 2024☆22Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 3 years ago
- TTS Text Analyzer☆32Updated last year
- ☆19Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge