sillsdev / silnlpLinks
A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.
☆36Updated last week
Alternatives and similar repositories for silnlp
Users that are interested in silnlp are comparing it to the libraries listed below
Sorting:
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆67Updated last month
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Massively multilingual pronunciation mining☆344Updated last month
- 🙊 software for creating speech recognition models.☆159Updated last year
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 2 weeks ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆214Updated 11 months ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- Aksharamukha Python Library☆50Updated 5 months ago
- SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/☆57Updated last year
- A multilingual parallel corpus created from translations of the Bible.☆182Updated last month
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆268Updated last month
- Audiobook alignment for Indigenous languages☆40Updated last week
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆13Updated 2 years ago
- PHOIBLE data and development.☆126Updated last year
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆87Updated last year
- Python API to access glottolog/glottolog☆30Updated last month
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated 2 years ago
- File format, model, API, and apps for manipulating text and its annotated features☆73Updated last week
- An NLP pipeline for Hebrew☆38Updated last month
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆283Updated 4 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆49Updated 2 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated last week
- ☆28Updated 9 months ago
- Ancient Greek language models for spaCy☆31Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 4 months ago
- ☆74Updated 3 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Software for phonetic transcription of English and Finnish, and IPA tools☆15Updated 9 years ago
- Script for workflow to add morphological analysis into ELAN files☆13Updated 5 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated last week