earwig / mwparserfromhellLinks
A Python parser for MediaWiki wikicode
☆854Updated 6 months ago
Alternatives and similar repositories for mwparserfromhell
Users that are interested in mwparserfromhell are comparing it to the libraries listed below
Sorting:
- A Python library to parse MediaWiki WikiText☆316Updated 7 months ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆714Updated this week
- Python client library to interface with the MediaWiki API☆339Updated 2 weeks ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated last month
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆589Updated 2 years ago
- Python wrapper for Wikipedia☆711Updated 3 weeks ago
- Heuristic based boilerplate removal tool☆810Updated 10 months ago
- Wikidata client library for Python☆363Updated 2 months ago
- Streaming WARC/ARC library for fast web archive IO☆442Updated last year
- A Python Wiktionary Parser☆369Updated 5 months ago
- Python stemming library using snowball stemmers☆275Updated last month
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last month
- A modern, interlingual wordnet interface for Python☆277Updated this week
- A Python library for working with and comparing language codes.☆353Updated 8 months ago
- spellchecking library for python☆617Updated 3 months ago
- All languages stopwords collection☆475Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- ☆177Updated 9 months ago
- Compact Language Detector 2☆887Updated 4 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- The Open English WordNet☆689Updated this week
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- Port of Google's language-detection library to Python.☆1,867Updated 10 months ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 3 months ago
- A simple interface to the Project Gutenberg corpus.☆331Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- Python library for reading and writing warc files☆247Updated 3 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆855Updated last month