lehinevych / MediaWikiAPILinks
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
☆42Updated 2 weeks ago
Alternatives and similar repositories for MediaWikiAPI
Users that are interested in MediaWikiAPI are comparing it to the libraries listed below
Sorting:
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated 3 weeks ago
- A helper library full of URL-related heuristics.☆73Updated 2 months ago
- Extract text from HTML☆135Updated 5 years ago
- Parse numbers written in natural language☆124Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Alternative robots parser module for Python☆20Updated last week
- Parse government documents into well formed JSON☆75Updated this week
- Utilize your personal data like Google!☆161Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- python functions for applied use of schema.org☆36Updated 4 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 4 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated last month
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 3 months ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- python library to simplify working with jsonlines and ndjson data☆305Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆130Updated 2 months ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Updated last year
- A Python implementation of Lunr.js 🌖☆202Updated 9 months ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 6 months ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 3 years ago
- A pure-Python robots.txt parser with support for modern conventions.☆74Updated last week
- Utility library to turn country names into ISO two-letter codes☆71Updated 4 months ago
- Python wrapper for Ferret☆45Updated 3 years ago
- Extract dates from text☆66Updated 4 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- URL normalization for Python☆99Updated 7 months ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆71Updated 3 years ago