Python package for harvesting records from OAI-PMH provider(s).
☆64Aug 4, 2022Updated 3 years ago
Alternatives and similar repositories for oai-harvest
Users that are interested in oai-harvest are comparing it to the libraries listed below
Sorting:
- Sickle: OAI-PMH for Humans☆115Jul 29, 2023Updated 2 years ago
- The oaipmh module is a Python implementation of an "Open Archives$ Initiative Protocol for Metadata Harvesting"☆86Jan 23, 2023Updated 3 years ago
- Command line OAI-PMH harvester and client with built-in cache.☆130Mar 2, 2026Updated last week
- OAI-PMH harvester in shell.☆17Dec 23, 2025Updated 2 months ago
- WARNING: This repository is no longer maintained. This repository will not be updated. The repository will be kept available in read-only…☆17Oct 26, 2021Updated 4 years ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated 10 months ago
- MOAI, an Open Access Server Platform for Institutional Repositories☆15Apr 21, 2023Updated 2 years ago
- A PHP API for Wikisource.☆11Sep 22, 2024Updated last year
- Ad-hoc light weight SPARQL endpoint from a file, using Python Flask and RDFlib☆15Oct 24, 2016Updated 9 years ago
- Pandoc filter to use Wikidata as reference manager☆19Nov 15, 2020Updated 5 years ago
- Experiment on metadata extraction using large language models such as GPT-3☆12Feb 1, 2023Updated 3 years ago
- Rank items in weighted lists☆18Jun 29, 2025Updated 8 months ago
- lod-explorativ is a prototype of a Svelte webapp which let you explore bibliographic resources from a topic's point of view.☆15Jan 19, 2022Updated 4 years ago
- ☆17Jul 17, 2025Updated 7 months ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Dec 8, 2022Updated 3 years ago
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆17Sep 6, 2022Updated 3 years ago
- Bookmarklet for showing RecordID as well as links for "Show PNX" and "Show Source Record"☆15Apr 28, 2021Updated 4 years ago
- Adding links to full text in Wikipedia references☆37Jun 16, 2025Updated 8 months ago
- Annotated corpus of data from War of The Rebellion (American Civil War archives)☆17Aug 8, 2016Updated 9 years ago
- Parses Wikipedia citation templates in Python☆17Mar 26, 2025Updated 11 months ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 3 months ago
- Workshop materials for Code4Lib 2016 pre-conference: http://2016.code4lib.org/workshops/Measuring-Your-Metadata☆20Feb 26, 2019Updated 7 years ago
- A pandoc-based layout workflow for scholarly journals. It relies on markdown, YAML and pandoc to obtain multiple publication formats☆23Dec 2, 2025Updated 3 months ago
- Erweiterung von Zotero für die Katalogisierung☆49Feb 22, 2024Updated 2 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Feb 5, 2022Updated 4 years ago
- PDBF - A Toolkit for Creating Janiform Data Documents☆50Jul 31, 2016Updated 9 years ago
- process MODS records from Python☆20Jul 27, 2022Updated 3 years ago
- Research Object BagIt archive☆21Jan 13, 2023Updated 3 years ago
- Jupiter is a University of Alberta Libraries-based initiative to create a sustainable and extensible digital asset management system. Thi…☆28Dec 12, 2025Updated 2 months ago
- Interfacing the Unpaywall Database with Python☆32Feb 19, 2024Updated 2 years ago
- This repository has migrated to:☆100Oct 11, 2025Updated 4 months ago
- Community Documentation for the Carpentries☆70Nov 26, 2024Updated last year
- Update to the Public Broadcasting Metadata Dictionary project☆23May 1, 2018Updated 7 years ago
- No longer maintained. Please use conciliator instead.☆25Oct 12, 2020Updated 5 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆31Dec 8, 2022Updated 3 years ago
- Python Functions defined for computational ION☆11Jul 7, 2017Updated 8 years ago