mediawiki-utilities / python-mwxmlLinks
A set of utilities for processing MediaWiki XML dump data.
☆56Updated 5 months ago
Alternatives and similar repositories for python-mwxml
Users that are interested in python-mwxml are comparing it to the libraries listed below
Sorting:
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated 11 months ago
- ☆39Updated 7 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆29Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- search interface for scholarly works☆85Updated 11 months ago
- Adding links to full text in Wikipedia references☆37Updated last month
- Perpetual Access To The Scholarly Record☆120Updated 11 months ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- The official repo for the QuickStatements PHP/HTML/JS interface☆46Updated 3 months ago
- A fun tool for quickly browsing unsourced snippets on Wikipedia.☆111Updated last week
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endp…☆77Updated last week
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 2 years ago
- Parses Wikipedia citation templates in Python☆17Updated 3 months ago
- Python tools for interacting with Wikidata☆154Updated last year
- Tool for generating filtered Wikidata RDF exports☆42Updated 3 years ago
- An online citation generator for Wikipedia☆31Updated 3 weeks ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- [OBSOLETE] Replaced by https://gitlab.wikimedia.org/toolforge-repos/python-toolforge☆22Updated 2 years ago
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆77Updated 3 months ago
- Command line interface to Wikidata Query Service☆55Updated last year
- Code for my Wikimedia Labs Tools account☆93Updated 3 weeks ago
- Python client library to interface with the MediaWiki API☆330Updated 2 weeks ago
- Python bot framework for Lexemes on Wikidata☆18Updated 4 years ago
- Imports Wiktionary's grammatical data into Wikidata☆18Updated 5 years ago
- Tools to process OpenAlex raw snapshot files☆12Updated 5 months ago
- Link Wikidata items to large catalogs☆96Updated 4 months ago
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆31Updated 6 years ago
- import information (affiliation, education) from ORCID database to Wikidata regarding authors of scientific papers☆15Updated 2 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated last year
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆180Updated 9 months ago