alpoktem / ProsographLinks
A Visualizer for prosodically annotated speech corpora
☆12Updated 3 years ago
Alternatives and similar repositories for Prosograph
Users that are interested in Prosograph are comparing it to the libraries listed below
Sorting:
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- ☆40Updated 3 years ago
- asr2k☆52Updated last year
- A collection of utilities for handling IPA phones.☆25Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆96Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Updated 2 years ago
- A phoneme-allophone database for many languages☆52Updated 5 years ago
- ☆17Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- ☆80Updated last month
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17Updated 11 years ago
- ☆22Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- ☆43Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- ☆12Updated 2 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 7 years ago
- ☆20Updated 6 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆41Updated 3 years ago
- Text-to-Speech tutorial at SLTU 2016☆34Updated 9 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- Python module for syllabifying English ARPABET transcriptions☆69Updated 6 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago