dohliam / hawaiian-corpus
Data from a corpus of written Hawaiian
☆13Updated 8 years ago
Related projects: ⓘ
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆14Updated last week
- The curation repository for the data behind Concepticon.☆32Updated this week
- Recipes for cooking with CLDF data☆17Updated 2 months ago
- Icelandic Treebank☆23Updated 3 months ago
- universal syllabification algorithms☆43Updated last year
- eXtensible Interlinear Glossed Text☆31Updated 2 years ago
- ☆54Updated 3 months ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- The Unicode Cookbook for Linguists☆53Updated 3 years ago
- The World Atlas of Language Structures☆51Updated 2 months ago
- CLDF: Cross-Linguistic Data Formats - the specification☆53Updated 5 months ago
- Public domain corpus of Catalan text☆16Updated 2 years ago
- Poetic processing, for Python.☆36Updated 4 months ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆20Updated 2 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆31Updated 5 years ago
- python package to read and write CLDF datasets☆15Updated last week
- Python Finite-State Toolkit☆39Updated last month
- The Atlas of Pidgin and Creole Language Structures☆9Updated last year
- Gramadán: a computational grammar of Irish☆14Updated last year
- A web framework to display Cross Linguistic Linked Data.☆54Updated last week
- A lexicon compiler for non-suffixational morphologies☆11Updated 2 months ago
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆22Updated 3 months ago
- A lemmatizer for Icelandic text☆16Updated 6 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 4 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆42Updated last year
- Global ASP - African Storybook Project for the World☆14Updated last year
- The Open Multilingual Wordnet☆58Updated 4 months ago
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- AUTOTYP data export☆38Updated last year