clarin-eric / harvest-manager
A simple Java application for managing an OAI-PMH harvesting workflow
☆14Updated last week
Alternatives and similar repositories for harvest-manager
Users that are interested in harvest-manager are comparing it to the libraries listed below
Sorting:
- Python client library which conforms to the SWORDv2 specification☆21Updated 2 years ago
- OxGarage is an web, and RESTful, service to manage the transformation of documents between a variety of formats. The majority of transfor…☆53Updated 9 years ago
- clarin-dspace digital repository based on DSpace and LINDAT/CLARIN DSpace☆28Updated this week
- Process, enhance and evaluate multiple OCR output.☆22Updated 6 months ago
- EFES (EpiDoc Front End Services) is a custom and readily customizable platform for publication and search/indexing of EpiDoc files, based…☆31Updated 3 months ago
- Metadata ingestion system for Digital Public Library of America☆30Updated last week
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- A module for Omeka S that provides an API for the Neatline 3 single page application☆14Updated 2 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆13Updated last year
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Updated 5 years ago
- Tools for TICCL☆14Updated 5 months ago
- Python script for breaking or atomizing OAI-PMH repositories into simpler text formats☆26Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Validator for the Image API☆37Updated 5 months ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- Heidelberg Monograph PublishingTool (heiMPT) is a stand-alone platform, as well as a plug-in application for OMP. It enables a high degre…☆22Updated 3 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆47Updated 2 years ago
- Some ideas on making Bags into Git repositories☆16Updated 10 years ago
- Legacy Repository: TEI SimplePrint now merged into TEI Repository. Originally TEI Simple aimed to define a new highly-constrained and pr…☆49Updated 8 years ago
- ☆62Updated 2 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- Specification of a stand-off element for the TEI guidelines☆12Updated 4 years ago
- ALTO XML schema - latest and all former versions☆52Updated 10 months ago
- Command line interface to Wikidata Query Service☆55Updated last year
- A java framework for filtering and modifying records from various sources☆17Updated 10 years ago