timmahrt / praatIOView external linksLinks
A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).
☆345Jan 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for praatIO
Users that are interested in praatIO are comparing it to the libraries listed below
Sorting:
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Jan 18, 2026Updated 3 weeks ago
- Praat in Python, the Pythonic way☆1,230Jan 23, 2026Updated 3 weeks ago
- Charsiu: A neural phonetic aligner.☆329Sep 19, 2022Updated 3 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆297Nov 8, 2023Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- Praat: Doing Phonetics By Computer☆1,842Updated this week
- Command line utility for forced alignment using Kaldi☆1,746Feb 2, 2026Updated last week
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆450Jul 16, 2024Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆129Dec 18, 2023Updated 2 years ago
- Trainable algorithm for automatic measurement of voice onset time☆67Jul 26, 2023Updated 2 years ago
- feature extraction from speech signals☆390Jun 15, 2025Updated 7 months ago
- A differentiable version of SPTK☆192Feb 3, 2026Updated last week
- A Python wrapper for the high-quality vocoder "World"☆778Jan 21, 2025Updated last year
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆367Oct 12, 2021Updated 4 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆134Nov 4, 2021Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- Praat textgrid manipulation in Python☆54Apr 3, 2025Updated 10 months ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Simple text to phones converter for multiple languages☆1,511Sep 26, 2024Updated last year
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆86Dec 20, 2024Updated last year
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Jun 22, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆340Jul 6, 2023Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 9 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆379Jul 21, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Jul 12, 2023Updated 2 years ago
- Mietta's Scripts are now on github! https://github.com/lennes/spect☆58Dec 14, 2021Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- Articulatory (text-to-) speech synthesis for Python☆27May 7, 2025Updated 9 months ago
- rPraat package for R☆30Dec 9, 2021Updated 4 years ago
- A suite of speech signal processing tools☆243Feb 3, 2026Updated last week
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 9 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆705Apr 26, 2024Updated last year