Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆45Nov 13, 2019Updated 6 years ago
Alternatives and similar repositories for Blizzard2013_Segmentation
Users that are interested in Blizzard2013_Segmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆122Jan 24, 2023Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Jul 17, 2021Updated 4 years ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆26Nov 4, 2023Updated 2 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆284Oct 10, 2023Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Jul 31, 2023Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆65May 30, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆160Jun 13, 2024Updated last year
- Official implementation of SpeechSplit2☆136Oct 22, 2022Updated 3 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆141Apr 27, 2024Updated last year
- ☆31Jul 13, 2023Updated 2 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆14Jun 15, 2021Updated 4 years ago
- This repository contains laughter-related synthesis systems.☆13Nov 7, 2020Updated 5 years ago
- Official implementation of MelHuBERT☆70Feb 21, 2026Updated last month
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆56Nov 16, 2025Updated 5 months ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆115Dec 2, 2020Updated 5 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆133Nov 29, 2023Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15May 30, 2021Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is the GitHub page for publicly available emotional speech data.☆385Jan 6, 2022Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆329Sep 24, 2022Updated 3 years ago
- ☆260May 15, 2023Updated 2 years ago
- TTS for pitch-accented language. Korean dialect DB.☆156May 12, 2023Updated 2 years ago