lenakmeth / Wikinflection-CorpusLinks
The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheniti and Neumann, 2020)
☆12Updated last year
Alternatives and similar repositories for Wikinflection-Corpus
Users that are interested in Wikinflection-Corpus are comparing it to the libraries listed below
Sorting:
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 5 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆33Updated 6 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- UIMA CAS processing library written in Python☆90Updated last month
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated last year
- LingPy: Python library for quantitative tasks in historical linguistics☆138Updated last week
- Multi Tier Annotation Search☆26Updated 4 years ago
- A Python library for topic modeling and visualization☆67Updated 5 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Updated 2 years ago
- Python 3 library for processing historical English☆67Updated last year
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 6 months ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 3 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆97Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Updated 3 years ago
- Detect and align similar passages☆112Updated 2 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Updated 3 years ago
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆23Updated 4 years ago
- Bias correction for richness in abundance data☆12Updated 3 months ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated last year
- Named Entity Recognition☆18Updated 8 months ago
- German lemmatization with IWNLP as extension for spaCy☆26Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆95Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆19Updated 6 years ago
- a python package for cleaning Gutenberg books and dataset☆34Updated 7 months ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year