siznax / wptools
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
☆575Updated last year
Alternatives and similar repositories for wptools:
Users that are interested in wptools are comparing it to the libraries listed below
- Wikidata client library for Python☆345Updated 6 months ago
- Fact Extraction from Wikipedia Text☆530Updated 8 years ago
- Python wrapper for Wikipedia☆622Updated this week
- A Python parser for MediaWiki wikicode☆775Updated 2 weeks ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- LexRank algorithm for text summarization☆230Updated 9 months ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆747Updated 2 years ago
- Python tools for interacting with Wikidata☆148Updated last year
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆647Updated this week
- Heuristic based boilerplate removal tool☆744Updated 8 months ago
- read and edit a Wikibase instance from the command line☆229Updated last month
- A tool for learning vector representations of words and entities from Wikipedia☆945Updated 8 months ago
- A Python library to parse MediaWiki WikiText☆300Updated 3 months ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆65Updated 2 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated 3 months ago
- Geotext extracts country and city mentions from text☆137Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆368Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆513Updated 3 months ago
- Python package for API access to news articles and events in the Event Registry☆234Updated last year
- NLP, before and after spaCy☆2,216Updated last year
- Python wrapper for Stanford CoreNLP☆353Updated 4 years ago
- Python interface to the Stanford Named Entity Recognizer☆291Updated 3 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆730Updated 5 months ago
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics☆209Updated last year
- displaCy.js: An open-source NLP visualiser for the modern web☆344Updated 6 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆253Updated 4 years ago
- Streaming WARC/ARC library for fast web archive IO☆395Updated last month
- A machine learning tool for fishing entities☆255Updated last week
- Entity linking system for Wikidata updated by your edits in real time☆251Updated last month
- The software used to extract structured data from Wikipedia☆871Updated 3 weeks ago