lenakmeth / Wikinflection-CorpusLinks
The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheniti and Neumann, 2020)
☆12Updated last year
Alternatives and similar repositories for Wikinflection-Corpus
Users that are interested in Wikinflection-Corpus are comparing it to the libraries listed below
Sorting:
- Bias correction for richness in abundance data☆12Updated 2 weeks ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆25Updated 3 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆31Updated 5 years ago
- Python 3 library for processing historical English☆67Updated 11 months ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Named Entity Recognition☆19Updated 3 months ago
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- LingPy: Python library for quantitative tasks in historical linguistics☆136Updated 4 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Preliminary spaCy models for Latin☆14Updated 2 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated last year
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated last month
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆166Updated last month
- Python wrapper for the CWB to extract concordances and score frequency lists☆22Updated 3 weeks ago
- German Morphological Analyzer☆47Updated 3 years ago
- This packages up data for the Open Multilingual Wordnet☆50Updated last month
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆146Updated 7 months ago
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and …☆9Updated 5 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Named entity annotation tool☆28Updated 2 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- CLDF: Cross-Linguistic Data Formats - the specification☆58Updated last year
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated last month