A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.
☆35May 5, 2023Updated 2 years ago
Alternatives and similar repositories for endangered-languages
Users that are interested in endangered-languages are comparing it to the libraries listed below
Sorting:
- List of research and engineering of NLP for American Native/Indigenous Languages.☆93Nov 23, 2020Updated 5 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Feb 4, 2026Updated 3 weeks ago
- Resources for conservation, development, and documentation of low resource (human) languages.☆436Apr 2, 2025Updated 11 months ago
- Wiktionary parser tool for many language editions.☆54Aug 17, 2022Updated 3 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Kalaallisut (Greenlandic) language☆12Updated this week
- ☆45Jul 5, 2022Updated 3 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- A Python library to add reconstructed pronunciations of Middle Chinese on Chinese texts☆11Mar 13, 2023Updated 2 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Archived Python/Rust hybrid codebase - see divvun/kbdgen for v3☆26Feb 7, 2022Updated 4 years ago
- Study on lexibank data (presenting the lexibank dataset).☆15Apr 11, 2025Updated 10 months ago
- ☆13Jun 9, 2020Updated 5 years ago
- R package for phonetic research and experimenting☆20Jul 29, 2024Updated last year
- Overview of Icelandic NLP resources at a glance☆18Jun 20, 2024Updated last year
- ☆19Oct 14, 2021Updated 4 years ago
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆20Jan 4, 2018Updated 8 years ago
- Random notes on Python internationalisation☆19Aug 10, 2023Updated 2 years ago
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆21Jun 26, 2024Updated last year
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Feb 1, 2022Updated 4 years ago
- The Data Format for Digital Linguistics (DaFoDiL)☆22Feb 7, 2023Updated 3 years ago
- Input a Chinese character and get all of its variant forms☆21Apr 13, 2025Updated 10 months ago
- Phonological CorpusTools☆121May 24, 2025Updated 9 months ago
- A multilingual parallel corpus created from translations of the Bible.☆193May 19, 2025Updated 9 months ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆35Updated this week
- A Praat plug-in for performing interactive phonetic forced alignment☆29Sep 22, 2018Updated 7 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 8 years ago
- Mother Tongues Dictionaries dictionary creation tool☆15May 21, 2024Updated last year
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Jan 31, 2026Updated last month
- Gamma Agreement in Python☆45Mar 4, 2024Updated last year
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- A tool for automatic phoneme transcription☆159Apr 18, 2023Updated 2 years ago
- convert formatted text to markdown☆13Dec 29, 2025Updated 2 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- ☆11Jun 6, 2016Updated 9 years ago
- Swift implementation of multihash☆15May 9, 2023Updated 2 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- open source knowledge for Syllabics font design and development☆10Nov 13, 2024Updated last year