ZackHodari/tts_data_tools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZackHodari/tts_data_tools)

ZackHodari / tts_data_tools

Data processing tools for preparing speech and labels for training TTS voices

☆29

Alternatives and similar repositories for tts_data_tools

Users that are interested in tts_data_tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
huiw39 / ExtensibleTTS-PyTorch
View on GitHub
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
☆26Jun 24, 2019Updated 7 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
tachi-hi / tts_samples
View on GitHub
Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…
☆15May 30, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
Helsinki-NLP / prosody
View on GitHub
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆250Oct 30, 2019Updated 6 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
ronggong / MIREX-2018-Automatic-Lyrics-to-Audio-Alignment
View on GitHub
Util code, issues, discussions
☆29Aug 31, 2018Updated 7 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
r9y9 / SynthesisFilters.jl
View on GitHub
Speech waveform synthesis filters
☆13Jul 21, 2017Updated 9 years ago
r9y9 / nnmnkwii_gallery
View on GitHub
A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.
☆70May 15, 2020Updated 6 years ago
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
t13m / kaldi-readers-for-tensorflow
View on GitHub
readers that enable reading kaldi ark in tensorflow
☆17Mar 7, 2018Updated 8 years ago
npuichigo / tarzan
View on GitHub
High-level API for tar-based dataset
☆12Feb 3, 2024Updated 2 years ago
stanford-oval / genie-parser
View on GitHub
Neural Network Semantic Parser for Almond
☆15Apr 11, 2019Updated 7 years ago
ksw0306 / WaveVAE
View on GitHub
A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")
☆127Feb 24, 2024Updated 2 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
ffont / ismir2016
View on GitHub
Instructions for reproducing the research described in the paper "Tempo Estimation for Music Loops and a Simple Confidence Measure"
☆14Nov 18, 2016Updated 9 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
View on GitHub
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
xcmyz / Transformer-TTS
View on GitHub
TTS model based on Transformer.
☆57Aug 2, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
colaudiolab / DeepLearning4UTI
View on GitHub
Deep Learning For Ultrasound Tongue Imaging
☆13Dec 17, 2024Updated last year
rguthrie3 / DeepDependencyParsingProblemSet
View on GitHub
A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch
☆15Aug 12, 2017Updated 8 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
kokeshing / WaveNet-Estimator
View on GitHub
WaveNet implementation using tf.estimator
☆21Jul 6, 2023Updated 3 years ago
witko0 / kaldifordummies
View on GitHub
Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…
☆11May 29, 2016Updated 10 years ago
nii-yamagishilab / TSNetVocoder
View on GitHub
☆42Oct 30, 2018Updated 7 years ago
rbarghou / pygriffinlim
View on GitHub
A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes
☆36Jan 17, 2024Updated 2 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
shuiliwanwu / ConvLstm-ultrasound-videos
View on GitHub
PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS
☆19Oct 29, 2018Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
yanggeng1995 / vae_tacotron
View on GitHub
☆51Feb 15, 2019Updated 7 years ago
gerazov / prosodeep
View on GitHub
Deep understanding and modelling of the hierarchical structure of prosody
☆25May 12, 2019Updated 7 years ago
r9y9 / kiritan_singing
View on GitHub
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆28Dec 31, 2023Updated 2 years ago
dayihengliu / Mu-Forcing-VRAE
View on GitHub
Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"
☆12May 27, 2019Updated 7 years ago