sillsdev / silnlp
A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.
☆35Updated this week
Alternatives and similar repositories for silnlp:
Users that are interested in silnlp are comparing it to the libraries listed below
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆57Updated 3 weeks ago
- Python API to access glottolog/glottolog☆29Updated 5 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆42Updated last year
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- Script for workflow to add morphological analysis into ELAN files☆13Updated 4 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated 2 weeks ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated last month
- 🙊 software for creating speech recognition models.☆158Updated 9 months ago
- Massively multilingual pronunciation mining☆333Updated 2 weeks ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆245Updated 7 months ago
- python package to read and write CLDF datasets☆15Updated last month
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated 11 months ago
- The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels. Select two poems below to …☆31Updated 5 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆156Updated this week
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆84Updated 10 months ago
- Audiobook alignment for Indigenous languages☆39Updated 3 weeks ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆25Updated this week
- The curation repository for the data behind Concepticon.☆38Updated last month
- In-browser OCR of Ancient Greek and Latin☆26Updated this week
- Yet another search platform for linguistic corpora.☆22Updated 2 weeks ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆127Updated last year
- SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/☆57Updated last year
- Jason Riggle's chart of phonological features in JSON format + extras☆53Updated 8 months ago
- ☆33Updated 9 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated last week
- File format, model, API, and apps for manipulating text and its annotated features☆71Updated last week
- ☆19Updated 3 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago