mediawiki-utilities / python-mwxml
A set of utilities for processing MediaWiki XML dump data.
☆45Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for python-mwxml
- [OBSOLETE] Replaced by https://gitlab.wikimedia.org/toolforge-repos/python-toolforge☆21Updated last year
- Sort-friendly URI Reordering Transform (SURT) python module☆40Updated 3 months ago
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endp…☆67Updated this week
- Tool for generating filtered Wikidata RDF exports☆37Updated 2 years ago
- search interface for scholarly works☆80Updated 3 months ago
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆247Updated last year
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated last year
- Python tools for interacting with Wikidata☆141Updated last year
- Wikimedia Pageview API client☆27Updated 6 years ago
- Adding links to full text in Wikipedia references☆37Updated 10 months ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆14Updated last year
- This repository has migrated to:☆100Updated 2 years ago
- Wikidata lexemes presentations☆24Updated last week
- read and edit a Wikibase instance from the command line☆227Updated this week
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆34Updated 5 months ago
- Imports Wiktionary's grammatical data into Wikidata☆17Updated 4 years ago
- The official repo for the QuickStatements PHP/HTML/JS interface☆41Updated 4 months ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated last month
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆56Updated 3 weeks ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆67Updated 2 years ago
- Extract, transform, and analyze bibliographic data from Wikidata dumps☆24Updated last year
- Python bot framework for Lexemes on Wikidata☆18Updated 3 years ago
- A tool to analyse, browse and query Wikidata☆84Updated last month
- ☆23Updated 9 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆11Updated 11 months ago
- Utility to translate NIF files across identifier schemes, such as DBpedia and Wikidata☆12Updated 5 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- Code for my Wikimedia Labs Tools account☆91Updated 3 months ago
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- Wikidata service to help create or link author items to published articles☆33Updated last month