andrewsilva9 / tune_tortoise_autoregressorView external linksLinks
Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.
☆15Nov 25, 2023Updated 2 years ago
Alternatives and similar repositories for tune_tortoise_autoregressor
Users that are interested in tune_tortoise_autoregressor are comparing it to the libraries listed below
Sorting:
- ☆25Mar 12, 2022Updated 3 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- A curated list of Text-to-Video Generation papers and BibTeX entries☆21Feb 21, 2024Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆69Mar 31, 2021Updated 4 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆46Nov 3, 2021Updated 4 years ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Jun 11, 2024Updated last year
- ☆21Jun 16, 2021Updated 4 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- ☆23Oct 5, 2017Updated 8 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- ☆23Dec 10, 2024Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- ☆25Mar 6, 2024Updated last year
- ☆64Jan 15, 2024Updated 2 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- Calculation of MCD (dB) between two speech waveforms☆57Sep 26, 2020Updated 5 years ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Jul 5, 2024Updated last year
- trying to reproduce suno v3☆35Jan 29, 2025Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- ☆97Jul 6, 2023Updated 2 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- voice conversion system☆25Jun 10, 2020Updated 5 years ago