cldf/segments

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cldf/segments)

cldf / segments

Unicode Standard tokenization routines and orthography profile segmentation

☆41

Alternatives and similar repositories for segments

Users that are interested in segments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Connum / npm-pinyin2ipa
View on GitHub
Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation
☆19Nov 28, 2023Updated 2 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
leiradel / luamods
View on GitHub
Collection of small Lua modules
☆10Feb 15, 2026Updated 5 months ago
jacobkrantz / lstm-syllabify
View on GitHub
Breaks a word into syllables using an LSTM-based neural network.
☆20Aug 14, 2023Updated 2 years ago
breckinloggins / vau
View on GitHub
A programming language
☆14Jan 24, 2015Updated 11 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vlasakm / mmtex
View on GitHub
A minimal modern (Lua)TeX distribution
☆15May 12, 2024Updated 2 years ago
bwoods / TeX--
View on GitHub
A TeX implementation in a single C++11 class.
☆20Sep 19, 2020Updated 5 years ago
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
uiuc-sst / asr24
View on GitHub
24-hour Automatic Speech Recognition
☆27Jun 4, 2021Updated 5 years ago
cldf-clts / clts
View on GitHub
Cross-Linguistic Transcription Systems
☆17Mar 20, 2026Updated 4 months ago
apertium / lexd
View on GitHub
A lexicon compiler for non-suffixational morphologies
☆15Jan 29, 2026Updated 5 months ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
gliese1337 / schrodinger-lisp
View on GitHub
☆21May 12, 2012Updated 14 years ago
cldf / pycldf
View on GitHub
python package to read and write CLDF datasets
☆21Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pldn / LDWizard
View on GitHub
🧙 LDWizard: A generic framework for simplifying the creation of linked data. Supported by the PLDN community.
☆18May 27, 2024Updated 2 years ago
CUNY-CL / wikipron
View on GitHub
Massively multilingual pronunciation mining
☆370Jul 13, 2026Updated last week
starwing / lbuffer
View on GitHub
a mutable string support to lua.
☆26Mar 20, 2015Updated 11 years ago
xinjli / ucla-phonetic-corpus
View on GitHub
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆46May 12, 2023Updated 3 years ago
rhdunn / cmudict-tools
View on GitHub
Tools for working with the CMU Pronunciation Dictionary
☆36Sep 5, 2017Updated 8 years ago
zauguin / luametalatex
View on GitHub
☆20Jul 16, 2023Updated 3 years ago
stefanocoretta / speakr
View on GitHub
speakr: A Wrapper for the Phonetic Software Praat
☆27Feb 28, 2026Updated 4 months ago
rhasspy / gruut-ipa
View on GitHub
Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)
☆106Nov 20, 2023Updated 2 years ago
BrownCLPS / LingView
View on GitHub
A web interface for viewing ELAN and FLEx files:
☆19Feb 16, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
Caucasus-Rosetta / Lingua-Corpus
View on GitHub
Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)
☆37Updated this week
brentp / lua-stringy
View on GitHub
fast lua string operations
☆22Mar 21, 2020Updated 6 years ago
xigt / xigt
View on GitHub
eXtensible Interlinear Glossed Text
☆34May 16, 2022Updated 4 years ago
yazone / g2pE_mobile
View on GitHub
g2p for english tts
☆19Nov 10, 2022Updated 3 years ago
lwang114 / UnsupTTS
View on GitHub
☆37Mar 26, 2024Updated 2 years ago
flashlight / sequence
View on GitHub
Sequence algorithms for use in Flashlight.
☆14Jan 12, 2026Updated 6 months ago
felixkreuk / SegFeat
View on GitHub
Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)
☆83Nov 13, 2021Updated 4 years ago
nrnrnr / build-prove-compare-student-code
View on GitHub
Student-facing code from the book *Programming Languages: Build, Prove, and Compare* by Norman Ramsey
☆35May 18, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
t13m / kaldi-readers-for-tensorflow
View on GitHub
readers that enable reading kaldi ark in tensorflow
☆17Mar 7, 2018Updated 8 years ago
unza-speech-lab / zambezi-voice
View on GitHub
Repository for multilingual speech data resources for native languages of Zambia.
☆22Oct 9, 2024Updated last year
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
viking-sudo-rm / industrial-stacknns
View on GitHub
Stack neural networks applied to hefty natural language tasks.
☆15Dec 26, 2019Updated 6 years ago
wharris / libesm
View on GitHub
C library for efficient string matching with Aho-Corasick
☆21Jan 20, 2012Updated 14 years ago
ex3ndr / supervoice-gpt-facodec
View on GitHub
GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago