stanford-policylab / asr-disparities
Code and data for Koenecke et al. (2020)
☆28Updated last year
Related projects ⓘ
Alternatives and complementary repositories for asr-disparities
- COre Variable Feature Extraction Feature Extractor☆30Updated last year
- fiwGAN/ciwGAN (Featural and Categorical InfoWaveGAN): Generative Adversarial Phonology and Semantics☆23Updated last year
- A guide to building language technology in new languages.☆57Updated 2 years ago
- Gamma Agreement in Python☆43Updated 8 months ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆27Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated 11 months ago
- Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions☆22Updated 3 years ago
- A neural language model that estimates incremental processing complexity☆39Updated 3 years ago
- ☆15Updated 6 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 7 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- ☆22Updated 2 years ago
- Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"☆15Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- All you need to get started for the Zero Speech Challenge 2017☆46Updated 5 years ago
- Speech2vec pre-trained word vectors☆77Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆37Updated last year
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 4 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- Code for AccentDB.☆19Updated 3 years ago
- A list of publicly available data sets from psycholinguistic studies☆31Updated 8 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆11Updated 3 years ago
- A repository containing links to useful phonological software☆11Updated last year
- Educational tutorials for speech and language processing classes☆12Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Novoic's linguistic feature extraction library☆35Updated 2 years ago
- ☆22Updated last year
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆42Updated 3 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated last year