digitallinguistics / data-format
The Data Format for Digital Linguistics (DaFoDiL)
☆22Updated last year
Alternatives and similar repositories for data-format:
Users that are interested in data-format are comparing it to the libraries listed below
- The Metadata Editor for Transparent Archiving of language document materials☆20Updated 3 weeks ago
- CLDF: Cross-Linguistic Data Formats - the specification☆56Updated 9 months ago
- Yet another search platform for linguistic corpora.☆21Updated this week
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- AUTOTYP data export☆41Updated last year
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- Lexical data at Unicode☆67Updated 4 months ago
- python package to read and write CLDF datasets☆15Updated last week
- Recipes for cooking with CLDF data☆17Updated 2 months ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Updated 3 years ago
- Script for workflow to add morphological analysis into ELAN files☆13Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Python API to access glottolog/glottolog☆28Updated 3 months ago
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and …☆9Updated 4 years ago
- Collaborative data curation for Glottolog☆155Updated 2 weeks ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 10 months ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 2 months ago
- Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!☆17Updated this week
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆46Updated last year
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 2 months ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Dative: software for linguistic fieldwork☆14Updated last year
- A lexicon compiler for non-suffixational morphologies☆11Updated last month
- A web framework to display Cross Linguistic Linked Data.☆55Updated 3 months ago
- Data space of the DARIAH Lexical Resources Working Group☆20Updated 4 months ago
- The World Atlas of Language Structures☆58Updated 3 months ago