sillsdev / silnlpLinks
A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.
☆36Updated this week
Alternatives and similar repositories for silnlp
Users that are interested in silnlp are comparing it to the libraries listed below
Sorting:
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆68Updated 2 months ago
- 🙊 software for creating speech recognition models.☆159Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Audiobook alignment for Indigenous languages☆40Updated 2 weeks ago
- Massively multilingual pronunciation mining☆346Updated last month
- SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/☆57Updated last year
- The Unicode Cookbook for Linguists☆56Updated 4 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆88Updated last year
- Jason Riggle's chart of phonological features in JSON format + extras☆54Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆214Updated last year
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆271Updated last month
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated last month
- A multilingual parallel corpus created from translations of the Bible.☆182Updated 2 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- An NLP pipeline for Hebrew☆38Updated last month
- File format, model, API, and apps for manipulating text and its annotated features☆73Updated 3 weeks ago
- pronunciation dictionaries for multiple languages☆90Updated 7 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 5 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆72Updated 2 weeks ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆36Updated last year
- A guide to building language technology in new languages.☆58Updated 3 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- Python API to access glottolog/glottolog☆30Updated last month
- Aksharamukha Python Library☆51Updated 6 months ago
- Linguistic processing for Common Voice☆57Updated last year
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated this week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆171Updated last month
- Python Finite-State Toolkit☆57Updated last week
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated 2 years ago
- The CMU Pronouncing Dictionary converted to IPA☆86Updated 6 years ago