alpoktem / ProsographLinks
A Visualizer for prosodically annotated speech corpora
☆12Updated 3 years ago
Alternatives and similar repositories for Prosograph
Users that are interested in Prosograph are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- Labeled data for homograph disambiguation☆57Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- ☆42Updated 2 years ago
- ☆80Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- ☆56Updated 2 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago
- ☆31Updated last year
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆12Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17Updated 11 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- ☆42Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 2 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆23Updated 3 years ago
- asr2k☆50Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago