PublicI / pdf-gcv-ocrLinks
Tool to OCR PDFs using Google Cloud Vision
☆42Updated 2 years ago
Alternatives and similar repositories for pdf-gcv-ocr
Users that are interested in pdf-gcv-ocr are comparing it to the libraries listed below
Sorting:
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- Ergonomic line-by-line transcription of scanned text.☆52Updated 4 years ago
- Conversions between various OCR formats☆78Updated 2 years ago
- A database of courts, tests and other experiments☆82Updated last week
- WARC and ARC indexing and discovery tools.☆124Updated 3 months ago
- A commandline tool and Python library for archiving data from Facebook using the Graph API.☆78Updated 7 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆189Updated last month
- Reading legal authority for the last time☆40Updated 3 months ago
- Process, enhance and evaluate multiple OCR output.☆22Updated 8 months ago
- guides and test data for OCR4all☆31Updated 2 years ago
- Named-Entity Recognition extension for OpenRefine☆28Updated 2 years ago
- A database of court reporters, tests and other experiments☆107Updated 2 weeks ago
- Review portion of FreeEed☆1Updated 6 months ago
- Structured data for classical studies☆19Updated 9 years ago
- an extensible tool to generate hyperlinks from legal citations☆34Updated 9 months ago
- A simple OpenRefine reconciliation service that runs on top of a CSV file☆120Updated 9 years ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Updated 5 years ago
- Easily display Zotero items on a webpage☆32Updated 2 years ago
- A reconciliation service for OpenRefine serving data from a given CSV file.☆79Updated 4 months ago
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the co…☆84Updated 3 years ago
- Master repository which includes most other OCR-D repositories as submodules☆73Updated this week
- A Rails engine supporting the discovery of web archives.☆50Updated 2 years ago
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆35Updated 2 months ago
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- Efficient hOCR tooling☆45Updated last month
- A collection of interesting software, processes, and methodologies built and used across Public Media.☆17Updated 5 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Metadata and per-statute PDFs for the U.S. Statutes at Large through volume 64 (1789-1951).☆17Updated 5 years ago