napsternxg / WikiUtils
A set of utility scripts to process Wikipedia related data
☆38Updated 2 years ago
Alternatives and similar repositories for WikiUtils:
Users that are interested in WikiUtils are comparing it to the libraries listed below
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆59Updated last year
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆109Updated 2 years ago
- Template for AC297r projects☆33Updated 5 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 8 months ago
- Wikidata embedding☆50Updated 5 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 9 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆80Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- Annotation tool for coreference☆32Updated 2 years ago
- Extracting useful metadata from Wikipedia dumps in any language.☆26Updated 5 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆255Updated 7 months ago
- Python tools for interacting with Wikidata☆153Updated last year
- UIMA CAS processing library written in Python☆88Updated last month
- Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.☆73Updated 8 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆45Updated 6 months ago
- This is an implementation of Hearst patterns, for finding hyponyms, written in Python.☆87Updated 2 years ago
- spaCy-to-naf converter☆21Updated 10 months ago
- An open information extraction system that provides compact extractions☆91Updated 3 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Updated 7 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 3 years ago
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Updated 6 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 6 years ago
- ☆54Updated 9 years ago
- ☆46Updated 4 years ago