earwig / mwparserfromhell
A Python parser for MediaWiki wikicode
☆758Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for mwparserfromhell
- A Python library to parse MediaWiki WikiText☆290Updated last month
- Python client library to interface with the MediaWiki API☆319Updated 3 weeks ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆633Updated this week
- Wikidata client library for Python☆342Updated 4 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated 3 weeks ago
- A tool for learning vector representations of words and entities from Wikipedia☆940Updated 6 months ago
- Streaming WARC/ARC library for fast web archive IO☆386Updated last week
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆574Updated last year
- A simple interface to the Project Gutenberg corpus.☆321Updated last year
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆744Updated 2 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆626Updated 3 years ago
- ☆130Updated 3 years ago
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆247Updated last year
- ☆165Updated 5 months ago
- read and edit a Wikibase instance from the command line☆227Updated this week
- Entity linking system for Wikidata updated by your edits in real time☆252Updated last year
- Python library for reading and writing warc files☆237Updated 2 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆185Updated 3 years ago
- A set of utilities for processing MediaWiki XML dump data.☆45Updated 3 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆725Updated 3 months ago
- spellchecking library for python☆601Updated 5 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- A modern, interlingual wordnet interface for Python☆221Updated this week
- A python implementation of the Rapid Automatic Keyword Extraction☆975Updated 4 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆230Updated 2 years ago
- A Python Wiktionary Parser☆360Updated 10 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- A wrapper for a remote SPARQL endpoint☆526Updated 3 months ago
- Python tools for interacting with Wikidata☆141Updated last year
- Python wrapper for Wikipedia☆600Updated this week