xinjli/allosaurus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xinjli/allosaurus)

xinjli / allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

☆737

Alternatives and similar repositories for allosaurus

Users that are interested in allosaurus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
dmort27 / allovera
View on GitHub
A phoneme-allophone database for many languages
☆54May 19, 2020Updated 6 years ago
xinjli / ucla-phonetic-corpus
View on GitHub
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆46May 12, 2023Updated 3 years ago
dmort27 / panphon
View on GitHub
Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.
☆319Oct 22, 2025Updated 9 months ago
dmort27 / epitran
View on GitHub
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
☆827Jun 18, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,558Sep 26, 2024Updated last year
CUNY-CL / wikipron
View on GitHub
Massively multilingual pronunciation mining
☆371Updated this week
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆347Sep 19, 2022Updated 3 years ago
xinjli / alqalign
View on GitHub
multilingual speech aligner
☆78Nov 19, 2023Updated 2 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
lingjzhu / CharsiuG2P
View on GitHub
Multilingual G2P in 100 languages
☆390May 26, 2023Updated 3 years ago
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆432Dec 8, 2023Updated 2 years ago
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,397Jun 6, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆926Jan 5, 2023Updated 3 years ago
AdolfVonKleist / Phonetisaurus
View on GitHub
Phonetisaurus G2P
☆517Jun 1, 2024Updated 2 years ago
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
festvox / datasets-CMU_Wilderness
View on GitHub
CMU Wilderness Multilingual Speech Dataset
☆292Apr 20, 2019Updated 7 years ago
ASR-project / Multilingual-PR
View on GitHub
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…
☆266May 9, 2022Updated 4 years ago
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago
NRC-ILT / g2p
View on GitHub
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆203Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
YannickJadoul / Parselmouth
View on GitHub
Praat in Python, the Pythonic way
☆1,274Updated this week
phoible / dev
View on GitHub
PHOIBLE data and development.
☆147Jul 8, 2026Updated 2 weeks ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,852Jul 11, 2026Updated 2 weeks ago
hash2430 / pitchtron
View on GitHub
TTS for pitch-accented language. Korean dialect DB.
☆155May 12, 2023Updated 3 years ago
uniglot / korean-word-ipa-dictionary
View on GitHub
Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)
☆23Nov 12, 2025Updated 8 months ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated last month
uiuc-sst / g2ps
View on GitHub
Data and code for grapheme-to-phoneme transducers in lots of languages
☆152Apr 5, 2024Updated 2 years ago
lingjzhu / zipa
View on GitHub
A family of efficient speech models for multilingual phone recognition
☆68Jul 18, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
felixkreuk / UnsupSeg
View on GitHub
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆146Aug 5, 2022Updated 3 years ago
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated 2 weeks ago
ivanvovk / WaveGrad
View on GitHub
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
☆409Jul 7, 2021Updated 5 years ago
timmahrt / praatIO
View on GitHub
A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …
☆349Jan 18, 2026Updated 6 months ago
asuni / wavelet_prosody_toolkit
View on GitHub
☆200May 3, 2024Updated 2 years ago