nrc-cnrc / gramble
Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les transducteurs.
☆13Updated last week
Alternatives and similar repositories for gramble:
Users that are interested in gramble are comparing it to the libraries listed below
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- Python Finite-State Toolkit☆50Updated last month
- universal syllabification algorithms☆43Updated 2 years ago
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- Cross-Linguistic Transcription Systems☆14Updated 2 months ago
- python package to read and write CLDF datasets☆15Updated this week
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆82Updated 9 months ago
- Recipes for cooking with CLDF data☆17Updated 2 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- The Language Independent Intelligent Dictionary☆23Updated 3 weeks ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated last month
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated 10 months ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 11 months ago
- Helsinki Finite-State Technology (library and application suite)☆128Updated 3 weeks ago
- Audiobook alignment for Indigenous languages☆38Updated last week
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- A guide to building language technology in new languages.☆58Updated 3 years ago
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- PHOIBLE data and development.☆121Updated 7 months ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 2 months ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- English Resource Grammar☆20Updated 6 months ago
- universal tokenizer☆15Updated 3 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆26Updated 5 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆61Updated last month
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆22Updated last week
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆35Updated this week
- The Data Format for Digital Linguistics (DaFoDiL)☆22Updated 2 years ago