lenakmeth / Wikinflection-Corpus
The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheniti and Neumann, 2020)
☆12Updated last year
Alternatives and similar repositories for Wikinflection-Corpus:
Users that are interested in Wikinflection-Corpus are comparing it to the libraries listed below
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated 9 months ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Preliminary spaCy models for Latin☆14Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆18Updated 5 years ago
- The Open Multilingual Wordnet☆61Updated 10 months ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆21Updated this week
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 7 months ago
- The curation repository for the data behind Concepticon.☆38Updated 2 weeks ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 3 months ago
- Bias correction for richness in abundance data☆11Updated last month
- German Morphological Analyzer☆47Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Updated 4 years ago
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and …☆9Updated 4 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆64Updated this week
- Python Multilingual Ucrel Semantic Analysis System☆31Updated 6 months ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆126Updated 3 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 11 months ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago