lehinevych / MediaWikiAPI
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
☆39Updated this week
Alternatives and similar repositories for MediaWikiAPI:
Users that are interested in MediaWikiAPI are comparing it to the libraries listed below
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Alternative robots parser module for Python☆17Updated last week
- Poetic processing, for Python.☆40Updated 10 months ago
- Stylometry library for Burrows' Delta method☆35Updated 10 months ago
- 📂 Additional lookup tables and data resources for spaCy☆105Updated last month
- Parse numbers written in natural language☆109Updated 4 months ago
- python functions for applied use of schema.org☆36Updated 3 years ago
- International Address formatter which considers the standard formatting rules of the country☆26Updated 3 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Faster, modernized fork of the language identification tool langid.py☆55Updated 3 months ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆37Updated 8 months ago
- A helper library full of URL-related heuristics.☆66Updated 5 months ago
- Python API to query a SPARQL endpoint☆32Updated 3 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Link Wikidata items to large catalogs☆96Updated last week
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆124Updated 2 months ago
- Potnia is an open-source Python library designed to convert Romanized transliterations of ancient texts into Unicode representations of t…☆16Updated last week
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated last month
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated last year
- Python based Wikidata framework for easy dataframe extraction☆43Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 3 years ago
- 🌸 Train floret vectors☆18Updated last year
- A fun tool for quickly browsing unsourced snippets on Wikipedia.☆109Updated last month
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago