Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Workshop
☆24Dec 8, 2019Updated 6 years ago
Alternatives and similar repositories for average_prosody
Users that are interested in average_prosody are comparing it to the libraries listed below
Sorting:
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- ☆15May 8, 2021Updated 4 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Implementation of DCTTS with Adversarial Training☆12Dec 30, 2019Updated 6 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- A tutorial diphone synthesizer in Python☆25Nov 26, 2018Updated 7 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Dec 18, 2018Updated 7 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Crowdsourced Audio Quality Evaluation Toolkit☆55Dec 7, 2022Updated 3 years ago
- Vocode spectrograms to audio with generative adversarial networks☆64Aug 8, 2019Updated 6 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- ☆37May 8, 2021Updated 4 years ago
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Jun 19, 2020Updated 5 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- ☆64Aug 14, 2023Updated 2 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Transfer learning exploration of dc_tts text-to-speech model☆21Mar 5, 2019Updated 6 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year