lenakmeth / Wikinflection
Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)
☆9Updated 4 years ago
Alternatives and similar repositories for Wikinflection:
Users that are interested in Wikinflection are comparing it to the libraries listed below
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 10 months ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆23Updated 3 years ago
- A lexicon compiler for non-suffixational morphologies☆11Updated last month
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- ☆63Updated 8 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆63Updated last week
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆148Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Python for Linguists – a Gentle Introduction to Programming☆44Updated 9 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆46Updated last year
- A simple configurable tool for manipulating dependency trees.☆13Updated last month
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Lexicons for the Multilingual UCREL Semantic Analysis System☆40Updated last year
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 2 months ago
- The Data Format for Digital Linguistics (DaFoDiL)☆22Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆55Updated last month
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- linguistics backend☆40Updated last year
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- Python framework for processing Universal Dependencies data☆55Updated last week
- This packages up data for the Open Multilingual Wordnet☆44Updated this week