Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.
☆15Nov 25, 2023Updated 2 years ago
Alternatives and similar repositories for tune_tortoise_autoregressor
Users that are interested in tune_tortoise_autoregressor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Mar 12, 2022Updated 4 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- ☆10Sep 18, 2017Updated 8 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 4 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Jun 11, 2024Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆45Nov 3, 2021Updated 4 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆34Jul 31, 2024Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 7 months ago
- Senren*Banka Civilization and Leader Murasame Mod for Civilization VI☆11Mar 13, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- ☆21Jun 16, 2021Updated 4 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆39Apr 15, 2024Updated last year
- ☆25Mar 6, 2024Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆130Jun 11, 2024Updated last year
- Generating slit scan images from videos in Python☆12Oct 25, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Jul 5, 2024Updated last year
- ☆31Nov 7, 2018Updated 7 years ago
- Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one i…☆12Apr 11, 2019Updated 6 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- GradioUI for TortoiseTTS voice generation☆33Oct 5, 2023Updated 2 years ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated last year