lehinevych / MediaWikiAPILinks
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
☆42Updated last month
Alternatives and similar repositories for MediaWikiAPI
Users that are interested in MediaWikiAPI are comparing it to the libraries listed below
Sorting:
- A helper library full of URL-related heuristics.☆73Updated 3 months ago
- Alternative robots parser module for Python☆20Updated last month
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated 2 months ago
- Extract text from HTML☆135Updated 5 years ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Updated last year
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 4 years ago
- A Python implementation of Lunr.js 🌖☆202Updated 10 months ago
- 🕊️ Radically lightweight command-line interfaces☆108Updated 4 months ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 3 years ago
- Parse numbers written in natural language☆124Updated last year
- python functions for applied use of schema.org☆37Updated 4 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 7 months ago
- Accurately find/replace/remove emojis in text strings☆163Updated 2 years ago
- Python API for PDF documents☆124Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- Abydos NLP/IR library for Python☆193Updated 3 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆26Updated last month
- ☆63Updated 2 weeks ago
- Python library that reads JSON files of any size.☆196Updated 2 years ago
- python library to simplify working with jsonlines and ndjson data☆306Updated last year
- Pythonic search engine based on PyLucene.☆131Updated 2 weeks ago
- Parse natural language time expressions in python☆131Updated 3 years ago
- Python wrapper for Ferret☆45Updated 4 years ago