lehinevych / MediaWikiAPILinks
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
☆42Updated last month
Alternatives and similar repositories for MediaWikiAPI
Users that are interested in MediaWikiAPI are comparing it to the libraries listed below
Sorting:
- A helper library full of URL-related heuristics.☆70Updated 2 weeks ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Parse numbers written in natural language☆123Updated 11 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆18Updated 2 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated last month
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Python client library to interface with the MediaWiki API☆336Updated last month
- Utilize your personal data like Google!☆160Updated 2 years ago
- International Address formatter which considers the standard formatting rules of the country☆26Updated 4 years ago
- Utility library to turn country names into ISO two-letter codes☆71Updated 2 months ago
- THIS REPOSITORY IS FORK☆30Updated 2 years ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- Alternative robots parser module for Python☆20Updated last month
- A python package to simulate typographical errors.☆37Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- A Python implementation of Lunr.js 🌖☆200Updated 7 months ago
- A set of utilities for processing MediaWiki XML dump data.☆57Updated 7 months ago
- ☆63Updated 9 months ago
- Accurately find/replace/remove emojis in text strings☆162Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated 2 months ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Updated 3 weeks ago
- Add website scraping abilities to Datasette☆64Updated 2 years ago
- Libzim binding for Python: read/write ZIM files in Python☆94Updated last month
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆148Updated 9 months ago
- Python wrapper for Ferret☆43Updated 3 years ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Extract text from HTML☆134Updated 5 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago