internetarchive / analyze_ocr
Parse OCR result files for pagenos, tables of contents, etc.
☆14Updated 13 years ago
Alternatives and similar repositories for analyze_ocr
Users that are interested in analyze_ocr are comparing it to the libraries listed below
Sorting:
- A Rails engine supporting the discovery of web archives.☆50Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- A tool for the geospatial analysis, literary network visualization, and plot mapping of ancient texts☆14Updated 6 years ago
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆35Updated 3 weeks ago
- A Hypothes.is integration plugin for OJS☆11Updated last month
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 2 years ago
- WASAPI data transfer APIs☆44Updated 3 years ago
- utility to fetch provenance information from Internet Archive's Wayback Machine☆13Updated 2 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- A python client for the DPLA API☆43Updated 2 years ago
- WARC and ARC indexing and discovery tools.☆123Updated 2 months ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Selected code and data for The Online Books Page and related applications☆11Updated 2 weeks ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆157Updated 2 months ago
- This repo holds the source code for the web application☆15Updated last year
- Django app for managing PREMIS Events☆14Updated 3 months ago
- DEPRECATED. Replaced with Electron desktop application: https://github.com/bulk-reviewer/bulk-reviewer☆13Updated 6 years ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- Open-source tools for working with BIBFRAME (see: http://bibframe.org), by default BIBFRAME Lite (see: http://bibfra.me) and more general…