cmu-llab / wikihan
☆11Updated last year
Related projects: ⓘ
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆78Updated 10 months ago
- ☆26Updated 3 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆127Updated this week
- Read, write, and manipulate Praat TextGrid files with Python☆123Updated 9 months ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆213Updated last month
- Universal multilingual automatic speech transcription into IPA☆51Updated 3 weeks ago
- Praat textgrid manipulation in Python☆51Updated 7 months ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆81Updated 4 months ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆29Updated 7 months ago
- A guide to building language technology in new languages.☆57Updated 2 years ago
- Workflow for forced alignment between languages☆17Updated 7 months ago
- https://arxiv.org/pdf/2402.18025☆17Updated 3 weeks ago
- Keyword spotting and forced alignment in any language☆31Updated 2 months ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆39Updated 6 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆139Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- Python module for syllabifying English ARPABET transcriptions☆63Updated 5 years ago
- Acoustic distance measure for comparing pronunciations☆14Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆34Updated 3 years ago
- Generating artificial disfluencies from fluent text easily and promptly☆10Updated last year
- A phoneme-allophone database for many languages☆47Updated 4 years ago
- Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions☆22Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆20Updated this week
- Read-only mirror of Pynini☆121Updated 2 months ago
- simple textgrid to csv converter☆25Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆128Updated 5 months ago
- A Toolkit for ToBI Labeling with Python Data Structures☆24Updated 2 years ago
- Cross-Linguistic Transcription Systems☆14Updated 5 months ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆26Updated last year