A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).
☆344Jan 18, 2026Updated last month
Alternatives and similar repositories for praatIO
Users that are interested in praatIO are comparing it to the libraries listed below
Sorting:
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Jan 18, 2026Updated last month
- Praat in Python, the Pythonic way☆1,239Mar 2, 2026Updated 2 weeks ago
- Charsiu: A neural phonetic aligner.☆334Sep 19, 2022Updated 3 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆298Nov 8, 2023Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- Praat: Doing Phonetics By Computer☆1,862Updated this week
- Command line utility for forced alignment using Kaldi☆1,762Feb 24, 2026Updated 3 weeks ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆450Jul 16, 2024Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆130Feb 25, 2026Updated 2 weeks ago
- Trainable algorithm for automatic measurement of voice onset time☆68Jul 26, 2023Updated 2 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 9 months ago
- A differentiable version of SPTK☆195Feb 26, 2026Updated 2 weeks ago
- A Python wrapper for the high-quality vocoder "World"☆781Jan 21, 2025Updated last year
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆135Nov 4, 2021Updated 4 years ago
- Praat textgrid manipulation in Python☆55Apr 3, 2025Updated 11 months ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Simple text to phones converter for multiple languages☆1,517Sep 26, 2024Updated last year
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆89Dec 20, 2024Updated last year
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Jun 22, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆379Jul 21, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Mar 10, 2026Updated last week
- Mietta's Scripts are now on github! https://github.com/lennes/spect☆58Dec 14, 2021Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- rPraat package for R☆30Dec 9, 2021Updated 4 years ago
- A suite of speech signal processing tools☆243Feb 3, 2026Updated last month
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 10 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆713Apr 26, 2024Updated last year
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago