A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).
☆344Jan 18, 2026Updated last month
Alternatives and similar repositories for praatIO
Users that are interested in praatIO are comparing it to the libraries listed below
Sorting:
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Jan 18, 2026Updated last month
- Praat in Python, the Pythonic way☆1,238Mar 2, 2026Updated last week
- Charsiu: A neural phonetic aligner.☆332Sep 19, 2022Updated 3 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆298Nov 8, 2023Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- Praat: Doing Phonetics By Computer☆1,851Mar 1, 2026Updated last week
- Command line utility for forced alignment using Kaldi☆1,757Feb 24, 2026Updated last week
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆450Jul 16, 2024Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆129Feb 25, 2026Updated last week
- Trainable algorithm for automatic measurement of voice onset time☆68Jul 26, 2023Updated 2 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 8 months ago
- A differentiable version of SPTK☆193Feb 26, 2026Updated last week
- A Python wrapper for the high-quality vocoder "World"☆779Jan 21, 2025Updated last year
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆135Nov 4, 2021Updated 4 years ago
- Praat textgrid manipulation in Python☆54Apr 3, 2025Updated 11 months ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Simple text to phones converter for multiple languages☆1,515Sep 26, 2024Updated last year
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Jun 22, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆379Jul 21, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Jul 12, 2023Updated 2 years ago
- Mietta's Scripts are now on github! https://github.com/lennes/spect☆58Dec 14, 2021Updated 4 years ago
- rPraat package for R☆30Dec 9, 2021Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- A suite of speech signal processing tools☆243Feb 3, 2026Updated last month
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 10 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆709Apr 26, 2024Updated last year
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago