5j9 / wikitextparserLinks
A Python library to parse MediaWiki WikiText
☆310Updated 2 months ago
Alternatives and similar repositories for wikitextparser
Users that are interested in wikitextparser are comparing it to the libraries listed below
Sorting:
- A Python parser for MediaWiki wikicode☆817Updated last month
- Wikidata client library for Python☆356Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆105Updated last week
- Python client library to interface with the MediaWiki API☆331Updated last month
- A modern, interlingual wordnet interface for Python☆255Updated 3 weeks ago
- Streaming WARC/ARC library for fast web archive IO☆425Updated 7 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆185Updated 6 months ago
- A Python library for working with and comparing language codes.☆345Updated 3 months ago
- A python module for English lemmatization and inflection.☆268Updated last year
- Python tools for interacting with Wikidata☆154Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆53Updated 4 years ago
- This packages up data for the Open Multilingual Wordnet☆50Updated 2 months ago
- A Python Wiktionary Parser☆362Updated 2 weeks ago
- WordNet in JSON format.☆91Updated 4 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆60Updated 9 months ago
- A set of utilities for processing MediaWiki XML dump data.☆57Updated 5 months ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated last month
- ☆171Updated 4 months ago
- Entity linking system for Wikidata updated by your edits in real time☆256Updated 8 months ago
- Stand-alone WordNet API☆49Updated 3 years ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆691Updated this week
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆64Updated this week
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 7 months ago
- A compound word splitter for Python☆48Updated 3 years ago
- German part-of-speech dictionary☆45Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆168Updated 2 months ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆32Updated 5 years ago