earwig / mwparserfromhell
A Python parser for MediaWiki wikicode
☆758Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for mwparserfromhell
- A Python library to parse MediaWiki WikiText☆289Updated 3 weeks ago
- Python client library to interface with the MediaWiki API☆318Updated 2 weeks ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆632Updated this week
- Wikidata client library for Python☆342Updated 4 months ago
- A Python library for working with and comparing language codes.☆339Updated 7 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated last week
- A modern, interlingual wordnet interface for Python☆217Updated last week
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆574Updated last year
- read and edit a Wikibase instance from the command line☆227Updated 3 weeks ago
- A python module for English lemmatization and inflection.☆260Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,263Updated 3 years ago
- A tool for learning vector representations of words and entities from Wikipedia☆940Updated 6 months ago
- ☆165Updated 4 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Port of Google's language-detection library to Python.☆1,725Updated 9 months ago
- Heuristic based boilerplate removal tool☆727Updated 6 months ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆185Updated 3 years ago
- Python stemming library using snowball stemmers☆245Updated last month
- a collection of functions that measure the readability of a given body of text☆191Updated 7 years ago
- Fast multi-keyword search engine for text strings☆247Updated last month
- Textpipe: clean and extract metadata from text☆300Updated 3 years ago
- Python wrapper for Wikipedia☆600Updated this week
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆246Updated 11 months ago
- ☆795Updated last year
- mediawiki parser library☆103Updated this week
- Universal Dependencies online documentation☆272Updated this week
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆135Updated 3 months ago