mediawiki-utilities / python-mwxmlLinks
A set of utilities for processing MediaWiki XML dump data.
☆57Updated 6 months ago
Alternatives and similar repositories for python-mwxml
Users that are interested in python-mwxml are comparing it to the libraries listed below
Sorting:
- search interface for scholarly works☆86Updated last year
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endp…☆78Updated this week
- An online citation generator for Wikipedia☆31Updated 2 weeks ago
- Python tools for interacting with Wikidata☆154Updated last year
- ☆39Updated 7 years ago
- A fun tool for quickly browsing unsourced snippets on Wikipedia.☆112Updated this week
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆257Updated last year
- read and edit a Wikibase instance from the command line☆235Updated 3 months ago
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆78Updated 4 months ago
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated last year
- Perpetual Access To The Scholarly Record☆120Updated last year
- A Python library to parse MediaWiki WikiText☆312Updated 3 months ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- MOVED to https://gitlab.com/crossref/reference_matching_evaluation_framework☆17Updated 6 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆31Updated 6 years ago
- Parses Wikipedia citation templates in Python☆17Updated 5 months ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆115Updated this week
- Command line interface to Wikidata Query Service☆55Updated last year
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated last year
- The official repo for the QuickStatements PHP/HTML/JS interface☆46Updated last month
- Python client library to interface with the MediaWiki API☆333Updated last week
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆60Updated 9 months ago
- Get the scholarly citation for any research product: software, preprint, paper, or dataset☆82Updated 2 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆72Updated last week
- Adding links to full text in Wikipedia references☆37Updated 2 months ago
- A deep learning model for extracting references from text☆29Updated last year
- Python 3 library for processing historical English☆67Updated last year