bloomonkey / oai-harvest
Python package for harvesting records from OAI-PMH provider(s).
☆62Updated 2 years ago
Alternatives and similar repositories for oai-harvest:
Users that are interested in oai-harvest are comparing it to the libraries listed below
- Sickle: OAI-PMH for Humans☆109Updated last year
- The oaipmh module is a Python implementation of an "Open Archives$ Initiative Protocol for Metadata Harvesting"☆86Updated 2 years ago
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- python library for working with IIIF Image and Presentation APIs☆19Updated last month
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Named entity annotation tool☆27Updated last year
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Updated 4 years ago
- an RDF datastore that gives researchers control over the sharing of data between datasets☆41Updated 10 months ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- Python tools for performing various operations on ALTO XML files☆45Updated 2 weeks ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 7 months ago
- OpenRefine reconciler for Research Organization Registry☆13Updated this week
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 4 months ago
- Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects☆51Updated last year
- a CLI suggestion tool for Wikidata entities☆29Updated 8 years ago
- Documentation and Data related to the Linked Data and Wikidata Working Groups☆25Updated 10 months ago
- Lakesuperior, an alternative Fedora Repository implementation☆32Updated 2 years ago
- A project to coordinate implementing a system to signal whether references cited on Wikipedia are free to reuse☆19Updated 8 years ago
- IIIF Presentation API implementation in Python☆35Updated 10 months ago
- ANNotation Infrastructure using Finna: an automatic subject indexing tool using Finna as corpus☆15Updated 6 years ago
- IIIF Image API reference implementation and Python library☆55Updated 3 years ago
- This repository is community oriented wiki and issue tracker without any code. It is the community documentation and communication channe…☆22Updated 6 years ago
- This repository has migrated to:☆100Updated 2 years ago
- Project COUNTER/NISO SUSHI usage statistics☆54Updated 2 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Updated 2 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆14Updated 3 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago