Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆247Oct 30, 2019Updated 6 years ago
Alternatives and similar repositories for prosody
Users that are interested in prosody are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆198May 3, 2024Updated last year
- Encoder and Decoder and Attention Based Prosody Prediction☆68Jan 17, 2018Updated 8 years ago
- ☆111Mar 9, 2026Updated 2 weeks ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆362Dec 24, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Demo of Mandarin/Chinese TTS frontend☆285Apr 18, 2022Updated 3 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- Predict prosody labels for Chinese sentences.☆41Jul 7, 2022Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- Chinese text normalization for speech processing☆722Mar 18, 2023Updated 3 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Dec 6, 2018Updated 7 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆233Dec 27, 2019Updated 6 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 3 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- Command line utility for forced alignment using Kaldi☆1,774Feb 24, 2026Updated last month
- A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")☆126Feb 24, 2024Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆866Jul 22, 2023Updated 2 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Dec 18, 2018Updated 7 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆372Nov 5, 2021Updated 4 years ago
- ☆262Dec 8, 2022Updated 3 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- A WaveRNN implementation☆201Oct 14, 2019Updated 6 years ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆296Mar 15, 2025Updated last year