siznax / wptoolsLinks
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
☆591Updated 2 years ago
Alternatives and similar repositories for wptools
Users that are interested in wptools are comparing it to the libraries listed below
Sorting:
- Wikidata client library for Python☆359Updated last week
- Python wrapper for Wikipedia☆694Updated this week
- Python tools for interacting with Wikidata☆154Updated last year
- Filter and format a newline-delimited JSON stream of Wikibase entities☆101Updated 2 weeks ago
- Outputs a list of ranked DBpedia resources for a search string.☆187Updated 4 years ago
- Entity linking system for Wikidata updated by your edits in real time☆257Updated 9 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated last week
- A Python parser for MediaWiki wikicode☆827Updated 2 months ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆257Updated last year
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 3 months ago
- A simple interface to the Project Gutenberg corpus.☆329Updated 2 years ago
- A Python library to parse MediaWiki WikiText☆313Updated 4 months ago
- Guidelines.☆99Updated last year
- Fact Extraction from Wikipedia Text☆538Updated 9 years ago
- The software used to extract structured data from Wikipedia☆907Updated this week
- A wrapper for a remote SPARQL endpoint☆551Updated 4 months ago
- Docker containers for DBpedia Spotlight☆74Updated last year
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics☆212Updated last year
- Streaming WARC/ARC library for fast web archive IO☆430Updated 9 months ago
- GERBIL - General Entity annotatoR Benchmark☆229Updated this week
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Mult…☆182Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆260Updated last month
- Measure the readability of a given text using surface characteristics☆80Updated 7 months ago
- creates a docker image with Virtuoso preloaded with the latest DBpedia dataset☆126Updated 10 months ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆182Updated 2 years ago
- spaCy module for linking text to Wikidata items☆241Updated 2 years ago
- Key information extraction from text and graph visualization☆91Updated 5 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆526Updated 10 months ago
- Cleans Reddit Text Data☆83Updated 5 years ago