lehinevych / MediaWikiAPI
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
☆39Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for MediaWikiAPI
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Alternative robots parser module for Python☆16Updated 2 weeks ago
- A helper library full of URL-related heuristics.☆63Updated last month
- Sort-friendly URI Reordering Transform (SURT) python module☆40Updated 3 months ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- Add website scraping abilities to Datasette☆61Updated last year
- Extract text from HTML☆130Updated 4 years ago
- Parse numbers written in natural language☆109Updated 2 weeks ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated last week
- Python 3 library for reading and writing warc files☆21Updated 6 years ago
- Template repository for Python projects☆32Updated last month
- 🕊️ Radically lightweight command-line interfaces☆102Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- A Python library for defining rule-based overrides on messy data☆12Updated 9 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated last week
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆30Updated 7 months ago
- Python based Wikidata framework for easy dataframe extraction☆39Updated 11 months ago
- A python package to simulate typographical errors.☆31Updated 10 months ago
- Language detection using Spacy and Fasttext☆54Updated 10 months ago
- Python wrapper for Ferret☆42Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆98Updated last year
- 🌸 Train floret vectors☆18Updated last year
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated last week
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆65Updated 2 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 6 years ago
- Utility library to turn country names into ISO two-letter codes☆66Updated 3 weeks ago
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆17Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆121Updated this week
- Finds linguistic patterns effortlessly☆33Updated last year