mediawiki-utilities / python-mwxml
A set of utilities for processing MediaWiki XML dump data.
☆53Updated 3 months ago
Alternatives and similar repositories for python-mwxml
Users that are interested in python-mwxml are comparing it to the libraries listed below
Sorting:
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated 9 months ago
- Simple Python Wrapper around MediaWiki API☆30Updated 2 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated 11 months ago
- search interface for scholarly works☆85Updated 9 months ago
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆58Updated 6 months ago
- Wikidata lexemes presentations☆23Updated last month
- Miscellaneous scripts to gather and process data of wikis.☆22Updated 2 years ago
- Imports Wiktionary's grammatical data into Wikidata☆17Updated 5 years ago
- An online citation generator for Wikipedia☆31Updated last month
- Repo for the Wikimedia Listeria bot☆26Updated 7 months ago
- [OBSOLETE] Replaced by https://gitlab.wikimedia.org/toolforge-repos/python-toolforge☆22Updated 2 years ago
- Python tools for interacting with Wikidata☆153Updated last year
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆31Updated 6 years ago
- Tool for generating filtered Wikidata RDF exports☆42Updated 3 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated last year
- Adding links to full text in Wikipedia references☆37Updated last year
- Github mirror of "analytics/quarry/web" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_acce…☆43Updated 2 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- https://en.wikipedia.org/wiki/User:SuperHamster/CiteUnseen☆16Updated last year
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆49Updated 11 months ago
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆77Updated last month
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- The World Atlas of Language Structures☆60Updated 6 months ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- The official repo for the QuickStatements PHP/HTML/JS interface☆46Updated last month
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endp…☆78Updated this week
- Legal document classification with EuroVoc descriptors on 22 languages.☆26Updated last year
- Tools for querying various name-based gender inference services and evaluate them.☆10Updated 2 years ago
- Wikimedia Pageview API client☆27Updated 6 years ago