sillsdev / silnlp
A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.
☆35Updated this week
Alternatives and similar repositories for silnlp:
Users that are interested in silnlp are comparing it to the libraries listed below
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆54Updated this week
- Audiobook alignment for Indigenous languages☆38Updated last month
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆37Updated last year
- Python Finite-State Toolkit☆47Updated last week
- These are lists for a variety of languages containing words that are distinctive to each language.☆35Updated 2 years ago
- ☆19Updated 3 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- 🙊 software for creating speech recognition models.☆154Updated 7 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Script for workflow to add morphological analysis into ELAN files☆13Updated 4 years ago
- ☆30Updated 7 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated this week
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- Finite-state script normalization and processing utilities☆38Updated this week
- ☆24Updated 3 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆169Updated 5 months ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆32Updated this week
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Morfessor EM+Prune☆10Updated 4 years ago
- Yet another search platform for linguistic corpora.☆20Updated this week
- Open information and community for machine translation☆72Updated last month
- NTREX -- News Test References for MT Evaluation☆80Updated 7 months ago
- Massively multilingual pronunciation mining☆327Updated last month
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- CLDF: Cross-Linguistic Data Formats - the specification☆56Updated 9 months ago
- File format, model, API, and apps for manipulating text and its annotated features☆69Updated this week