mediawiki-utilities / python-mwxmlLinks
A set of utilities for processing MediaWiki XML dump data.
☆54Updated 4 months ago
Alternatives and similar repositories for python-mwxml
Users that are interested in python-mwxml are comparing it to the libraries listed below
Sorting:
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆31Updated 6 years ago
- search interface for scholarly works☆85Updated 10 months ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated last year
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated 10 months ago
- Adding links to full text in Wikipedia references☆37Updated last week
- The World Atlas of Language Structures☆61Updated 8 months ago
- Perpetual Access To The Scholarly Record☆120Updated 10 months ago
- Imports Wiktionary's grammatical data into Wikidata☆18Updated 5 years ago
- Tool for generating filtered Wikidata RDF exports☆42Updated 3 years ago
- A deep learning model for extracting references from text☆29Updated last year
- Citation Classification using hybrid neural network model for Wikipedia References☆29Updated 2 years ago
- Simple Python Wrapper around MediaWiki API☆30Updated 2 years ago
- Tools for querying various name-based gender inference services and evaluate them.☆10Updated 2 years ago
- Python tools for interacting with Wikidata☆153Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Wikipedia-based Taxonomies☆8Updated 9 months ago
- [OBSOLETE] Replaced by https://gitlab.wikimedia.org/toolforge-repos/python-toolforge☆22Updated 2 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endp…☆78Updated this week
- Wikimedia Pageview API client☆27Updated 6 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆43Updated 7 months ago
- Python bot framework for Lexemes on Wikidata☆18Updated 4 years ago
- The official repo for the QuickStatements PHP/HTML/JS interface☆46Updated 2 months ago
- Wikidata service to help create or link author items to published articles☆33Updated last month
- This packages up data for the Open Multilingual Wordnet☆49Updated 3 weeks ago
- Python 3 library for processing historical English☆67Updated 10 months ago
- MOVED to https://gitlab.com/crossref/reference_matching_evaluation_framework☆17Updated 5 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆16Updated 2 years ago
- Open Access PDF harvester☆40Updated last year
- Wikidata lexemes presentations☆23Updated 2 months ago