ZackHodari/average_prosody

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZackHodari/average_prosody)

ZackHodari / average_prosody

Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Workshop

☆24

Alternatives and similar repositories for average_prosody

Users that are interested in average_prosody are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZackHodari / discrete_intonation
View on GitHub
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…
☆17May 24, 2020Updated 6 years ago
ZackHodari / tts_data_tools
View on GitHub
Data processing tools for preparing speech and labels for training TTS voices
☆29Aug 13, 2020Updated 5 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
Yangyangii / AdvDCTTS
View on GitHub
Implementation of DCTTS with Adversarial Training
☆12Dec 30, 2019Updated 6 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
nii-yamagishilab / self-attention-tacotron
View on GitHub
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …
☆114Jun 19, 2020Updated 6 years ago
huiw39 / ExtensibleTTS-PyTorch
View on GitHub
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
☆26Jun 24, 2019Updated 7 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
yanggeng1995 / vae_tacotron
View on GitHub
☆51Feb 15, 2019Updated 7 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
jaywalnut310 / Vector-Quantized-Autoencoders
View on GitHub
Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"
☆15Feb 27, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
kastnerkyle / diphone_synthesizer
View on GitHub
A tutorial diphone synthesizer in Python
☆26Nov 26, 2018Updated 7 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
stanford-oval / genie-parser
View on GitHub
Neural Network Semantic Parser for Almond
☆15Apr 11, 2019Updated 7 years ago
vara-tts / VARA-TTS
View on GitHub
Demo audio of VARA-TTS model
☆20Jun 11, 2021Updated 5 years ago
xcmyz / Lifelong-Learning-Tacotron2
View on GitHub
MultiSpeaker Tacotron2 using LifeLong Learning.
☆13Sep 27, 2019Updated 6 years ago
ChangLabUcsf / intonatang
View on GitHub
Analysis code for speech intonation project
☆14Feb 19, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhitko / inton-trainer
View on GitHub
Inton Trainer is designed for learning the intonation of oral speech.
☆13Jun 27, 2026Updated 3 weeks ago
colaudiolab / DeepLearning4UTI
View on GitHub
Deep Learning For Ultrasound Tongue Imaging
☆13Dec 17, 2024Updated last year
rguthrie3 / DeepDependencyParsingProblemSet
View on GitHub
A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch
☆15Aug 12, 2017Updated 8 years ago
xcmyz / Transformer-TTS
View on GitHub
TTS model based on Transformer.
☆57Aug 2, 2019Updated 6 years ago
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
kastnerkyle / representation_mixing
View on GitHub
Demos, pretrained models, and (WIP) code supporting Representation Mixing
☆51Dec 18, 2018Updated 7 years ago
ErikEkstedt / conv_ssl
View on GitHub
☆14Feb 9, 2023Updated 3 years ago
witko0 / kaldifordummies
View on GitHub
Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…
☆11May 29, 2016Updated 10 years ago
andi611 / CS-Tacotron-Pytorch
View on GitHub
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
☆23Mar 14, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
shreyas253 / SylNet
View on GitHub
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆27May 25, 2023Updated 3 years ago
shuiliwanwu / ConvLstm-ultrasound-videos
View on GitHub
PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS
☆19Oct 29, 2018Updated 7 years ago
gerazov / PySFC
View on GitHub
Python implementation of the SFC intonation model.
☆18Nov 29, 2017Updated 8 years ago
r9y9 / nnmnkwii_gallery
View on GitHub
A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.
☆70May 15, 2020Updated 6 years ago
paarthneekhara / advoc
View on GitHub
Vocode spectrograms to audio with generative adversarial networks
☆64Aug 8, 2019Updated 6 years ago
gerazov / prosodeep
View on GitHub
Deep understanding and modelling of the hierarchical structure of prosody
☆25May 12, 2019Updated 7 years ago
zerospeech / zerospeech2021_baseline
View on GitHub
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Oct 19, 2022Updated 3 years ago