lehinevych / MediaWikiAPI
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
☆39Updated last month
Alternatives and similar repositories for MediaWikiAPI
Users that are interested in MediaWikiAPI are comparing it to the libraries listed below
Sorting:
- Alternative robots parser module for Python☆17Updated 2 months ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Metadata extraction at a distance☆24Updated 3 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆183Updated 3 months ago
- ISO 639 library for Python☆33Updated 8 months ago
- A python package to simulate typographical errors.☆34Updated last year
- Python based Wikidata framework for easy dataframe extraction☆44Updated last year
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Custom Python functions for working with SQLite FTS4☆22Updated 2 years ago
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆58Updated 6 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated 9 months ago
- Utility library to turn country names into ISO two-letter codes☆66Updated 2 months ago
- A helper library full of URL-related heuristics.☆69Updated last month
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- smart imports for Python☆39Updated 3 years ago
- International Address formatter which considers the standard formatting rules of the country☆26Updated 3 years ago
- 🌸 Train floret vectors☆18Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆16Updated 2 years ago
- Miscellaneous scripts to gather and process data of wikis.☆22Updated 2 years ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- Python benchmark tool inspired by Geekbench.☆17Updated 10 months ago
- Finds linguistic patterns effortlessly☆36Updated last year
- LazyText is inspired by the idea of lazypredict, a library which helps build lot of basic models without much code. LazyText is for text …☆18Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last month