siznax / wptools
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
☆581Updated last year
Alternatives and similar repositories for wptools:
Users that are interested in wptools are comparing it to the libraries listed below
- Wikidata client library for Python☆353Updated 9 months ago
- A Python parser for MediaWiki wikicode☆786Updated this week
- Python wrapper for Wikipedia☆651Updated this week
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated 2 months ago
- read and edit a Wikibase instance from the command line☆230Updated last month
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆666Updated last week
- Fact Extraction from Wikipedia Text☆534Updated 8 years ago
- Entity linking system for Wikidata updated by your edits in real time☆254Updated 4 months ago
- A simple interface to the Project Gutenberg corpus.☆325Updated 2 years ago
- A Pythonic wrapper for the Wikipedia API☆2,943Updated 10 months ago
- A Python library to parse MediaWiki WikiText☆305Updated 5 months ago
- Python client library to interface with the MediaWiki API☆326Updated 2 weeks ago
- Streaming WARC/ARC library for fast web archive IO☆408Updated 4 months ago
- Python tools for interacting with Wikidata☆153Updated last year
- Python interface to the Stanford Named Entity Recognizer☆292Updated 3 years ago
- Measure the readability of a given text using surface characteristics☆78Updated 2 months ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,825Updated 9 months ago
- 🦆 Contextually-keyed word vectors☆1,645Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆254Updated 7 months ago
- Full text geoparsing as a Python library☆747Updated 3 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆65Updated 3 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆630Updated 3 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆746Updated 2 years ago
- spaCy module for linking text to Wikidata items☆232Updated 2 years ago
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- Heuristic based boilerplate removal tool☆765Updated last month
- A machine learning tool for fishing entities