earwig / mwparserfromhellLinks
A Python parser for MediaWiki wikicode
☆838Updated 3 months ago
Alternatives and similar repositories for mwparserfromhell
Users that are interested in mwparserfromhell are comparing it to the libraries listed below
Sorting:
- A Python library to parse MediaWiki WikiText☆315Updated 5 months ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆707Updated this week
- Python client library to interface with the MediaWiki API☆339Updated last week
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆586Updated 2 years ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated last month
- Streaming WARC/ARC library for fast web archive IO☆434Updated 10 months ago
- Wikidata client library for Python☆360Updated last month
- Python wrapper for Wikipedia☆702Updated this week
- A Python library for working with and comparing language codes.☆351Updated 5 months ago
- Heuristic based boilerplate removal tool☆800Updated 8 months ago
- A Python Wiktionary Parser☆367Updated 3 months ago
- Python stemming library using snowball stemmers☆264Updated 2 months ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last week
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆377Updated 2 years ago
- spellchecking library for python☆613Updated last month
- Access a database of word frequencies, in various natural languages.☆1,559Updated 9 months ago
- Port of Google's language-detection library to Python.☆1,851Updated 7 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆1,043Updated 5 months ago
- A set of utilities for processing MediaWiki XML dump data.☆57Updated 8 months ago
- A python module for English lemmatization and inflection.☆272Updated 2 years ago
- A modern, interlingual wordnet interface for Python☆267Updated last month
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆748Updated 3 years ago
- All languages stopwords collection☆459Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆148Updated 10 months ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- Various utilities for processing the data.☆213Updated this week
- The software used to extract structured data from Wikipedia☆911Updated last week
- A tool for learning vector representations of words and entities from Wikipedia☆955Updated last year