Edresson / TTS
Deep learning for Text to Speech
☆27Updated 4 years ago
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- Open Source Text-To-Speech Portuguese Dataset☆166Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Updated 5 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆123Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆105Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆97Updated 10 months ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆167Updated last year
- Python toolkit for speech processing☆68Updated last week
- ☆80Updated 11 months ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 9 months ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Wav2vec resources and models for Brazilian Portuguese☆33Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…☆72Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆83Updated last year
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Updated 4 years ago
- ☆39Updated last year
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆138Updated 3 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆99Updated 3 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- MeetEval - A meeting transcription evaluation toolkit☆96Updated this week
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year