napsternxg / WikiUtils
A set of utility scripts to process Wikipedia related data
☆37Updated 2 years ago
Alternatives and similar repositories for WikiUtils:
Users that are interested in WikiUtils are comparing it to the libraries listed below
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆57Updated 9 months ago
- Extracting useful metadata from Wikipedia dumps in any language.☆26Updated 5 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- Template for AC297r projects☆33Updated 5 years ago
- Python tools for interacting with Wikidata☆148Updated last year
- Extracting scientific claims from biomedical abstracts (powered by AllenNLP)☆140Updated 3 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆108Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 6 months ago
- ☆54Updated 3 years ago
- ☆57Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Data and code from our "Inferring Which Medical Treatments Work from Reports of Clinical Trials", NAACL 2019. This work concerns inferrin…☆61Updated 3 years ago
- ☆54Updated 9 years ago
- JSON-NLP Schema for transfer of NLP output using JSON☆51Updated 4 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆44Updated 4 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- This is an implementation of Hearst patterns, for finding hyponyms, written in Python.☆88Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Mapping Wikipedia pages to Wikidata IDs and vice versa.☆156Updated last year
- ☆64Updated last year
- Python code for reading Brat Repositories. Supports saving and reading from XML files for easy acces to annotations.☆41Updated 5 years ago
- An open information extraction system that provides compact extractions☆90Updated 2 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆69Updated last year
- Repository containing data and code of the ACL-19 paper "Relational Word Embeddings"☆13Updated 4 years ago
- MedType: Improving Medical Entity Linking with Semantic Type Prediction☆116Updated last year
- Knowledge Base Embeddings for DBpedia☆84Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago