internetarchive / analyze_ocrLinks
Parse OCR result files for pagenos, tables of contents, etc.
☆14Updated 13 years ago
Alternatives and similar repositories for analyze_ocr
Users that are interested in analyze_ocr are comparing it to the libraries listed below
Sorting:
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆35Updated 2 months ago
- A Rails engine supporting the discovery of web archives.☆50Updated 2 years ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆39Updated last year
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- WASAPI data transfer APIs☆45Updated 3 years ago
- A tool for the geospatial analysis, literary network visualization, and plot mapping of ancient texts☆14Updated 6 years ago
- GeoNames Reconciliation Service for OpenRefine/LODRefine/Google Refine☆48Updated 3 years ago
- utility to fetch provenance information from Internet Archive's Wayback Machine☆13Updated 3 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆50Updated 2 months ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Command-line tile downloader/assembler for IIIF endpoints/manifests☆35Updated 3 years ago
- JSKOS data format for Knowledge Organization Systems☆42Updated last week
- A Hypothes.is integration plugin for OJS☆11Updated 3 months ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 5 years ago
- ☆63Updated 2 years ago
- Social Feed Manager user interface application.☆155Updated last year
- ☆14Updated 8 years ago
- A python client for the DPLA API☆43Updated 2 years ago
- This project is no longer supported. A pre-configured collection of tools including Social Feed Manager and Lentil for easily building Tw…☆15Updated 7 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆46Updated 5 years ago
- This repo holds the source code for the web application☆15Updated last year
- MARC enhancements for Blacklight☆19Updated last week
- DEPRECATED. Replaced with Electron desktop application: https://github.com/bulk-reviewer/bulk-reviewer☆13Updated 6 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated this week
- work to make the ldr premis compliant☆8Updated 8 years ago
- Metadata ingestion system for Digital Public Library of America☆30Updated last month
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago