Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera videos using DeepLabCut. This research is by Wrench, A. and Balch-Tomes, J. (2022) (https://www.mdpi.com/1424-8220/22/3/1133) (https://doi.org/10.3390/s22031133).
☆24Jun 13, 2023Updated 2 years ago
Alternatives and similar repositories for DeepLabCut-for-Speech-Production
Users that are interested in DeepLabCut-for-Speech-Production are comparing it to the libraries listed below
Sorting:
- A fasttrack implementation in python☆13Feb 10, 2026Updated 3 weeks ago
- Tools to process the UltraSuite data☆13Nov 6, 2019Updated 6 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- A Free/Open-Source tool for manual annotation of Ultrasound Tongue Imaging data.☆10Nov 30, 2025Updated 3 months ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Code designed for analysis of tongue contour data - produces three metrics (Procrustes analysis, Modified Curvature Index and Fourier ana…☆10Apr 19, 2024Updated last year
- A package of scripts for processing and analyzing ultrasound data for research in linguistics☆12Feb 1, 2016Updated 10 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- ☆30May 3, 2023Updated 2 years ago
- Make Praat Picture style plots of acoustic data☆37Feb 4, 2026Updated 3 weeks ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- Matlab tool for interactively extracting tongue contours from Ultrasound movie or DICOM sequences☆17Apr 30, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Pronounce Arabic words☆19May 27, 2019Updated 6 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Phonetic Analysis ToolKIT - PATKIT - Python package for analysing phonetic data☆11Feb 24, 2026Updated last week
- ☆12Feb 11, 2026Updated 2 weeks ago
- wrassp is a wrapper for R around Michel Scheffers's libassp (Advanced Speech Signal Processor). The libassp library aims at providing fun…☆27Dec 19, 2025Updated 2 months ago
- ☆33Nov 27, 2021Updated 4 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 8 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago
- Automatic LInguistic Unit Count Estimator (ALICE)☆49Jan 27, 2025Updated last year
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 8 years ago
- TTS Android demo of PaddleSpeech, merged into https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos☆28Nov 30, 2022Updated 3 years ago
- A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer library, written in Flutter.☆12Nov 18, 2025Updated 3 months ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago