siznax / wptools
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
☆574Updated last year
Related projects ⓘ
Alternatives and complementary repositories for wptools
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated 3 weeks ago
- Wikidata client library for Python☆342Updated 4 months ago
- read and edit a Wikibase instance from the command line☆227Updated this week
- A Python parser for MediaWiki wikicode☆758Updated 4 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆249Updated 2 months ago
- A simple interface to the Project Gutenberg corpus.☆321Updated last year
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆247Updated last year
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,151Updated 5 months ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated last month
- Entity linking system for Wikidata updated by your edits in real time☆252Updated last year
- A Python library to parse MediaWiki WikiText☆290Updated last month
- Python wrapper for Wikipedia☆600Updated this week
- A wrapper for a remote SPARQL endpoint☆526Updated 3 months ago
- GERBIL - General Entity annotatoR Benchmark☆224Updated last week
- A Python function to break down hashtags or compound words created by putting together multiple words☆32Updated 9 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆512Updated 3 weeks ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆65Updated 2 years ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆633Updated this week
- PYthon Automated Term Extraction☆305Updated last year
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 4 years ago
- Textpipe: clean and extract metadata from text☆299Updated 3 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆725Updated 3 months ago
- Python tools for interacting with Wikidata☆141Updated last year
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆765Updated 2 years ago
- Visualize Wikidata items using d3.js☆193Updated 3 months ago
- Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby☆601Updated 6 years ago
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Mult…☆178Updated last year
- Python interface to the Stanford Named Entity Recognizer☆293Updated 3 years ago
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- A tool for learning vector representations of words and entities from Wikipedia☆940Updated 6 months ago