Data processing tools for preparing speech and labels for training TTS voices
☆29Aug 13, 2020Updated 5 years ago
Alternatives and similar repositories for tts_data_tools
Users that are interested in tts_data_tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15May 30, 2021Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆247Oct 30, 2019Updated 6 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- readers that enable reading kaldi ark in tensorflow☆17Mar 7, 2018Updated 8 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 5 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 6 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")☆126Feb 24, 2024Updated 2 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Instructions for reproducing the research described in the paper "Tempo Estimation for Music Loops and a Simple Confidence Measure"☆14Nov 18, 2016Updated 9 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- Deep Learning For Ultrasound Tongue Imaging☆12Dec 17, 2024Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆42Oct 30, 2018Updated 7 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆34Jan 17, 2024Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"☆12May 27, 2019Updated 6 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago