sillsdev / silnlp
A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.
☆35Updated this week
Alternatives and similar repositories for silnlp:
Users that are interested in silnlp are comparing it to the libraries listed below
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆56Updated last week
- Python API to access glottolog/glottolog☆29Updated 3 months ago
- Audiobook alignment for Indigenous languages☆38Updated last week
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year
- ☆19Updated 3 years ago
- Script for workflow to add morphological analysis into ELAN files☆13Updated 4 years ago
- File format, model, API, and apps for manipulating text and its annotated features☆70Updated this week
- Syntax trees, morphology, and linguistic annotations for the Greek Bible☆24Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated this week
- Python Finite-State Toolkit☆50Updated last month
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆82Updated 9 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- Massively multilingual pronunciation mining☆331Updated 3 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆40Updated last year
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated last week
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆32Updated last year
- The curation repository for the data behind Concepticon.☆37Updated this week
- Perseus Treebank Data☆71Updated 8 months ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 11 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated last week
- ☆19Updated last month
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 11 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated last month
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated 10 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆47Updated last year
- 🙊 software for creating speech recognition models.☆158Updated 8 months ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆240Updated 6 months ago