PublicI / pdf-gcv-ocrLinks
Tool to OCR PDFs using Google Cloud Vision
☆42Updated 2 years ago
Alternatives and similar repositories for pdf-gcv-ocr
Users that are interested in pdf-gcv-ocr are comparing it to the libraries listed below
Sorting:
- A database of court reporters, tests and other experiments☆114Updated this week
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- A database of courts, tests and other experiments☆90Updated this week
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- an extensible tool to generate hyperlinks from legal citations☆35Updated 11 months ago
- Find legal citations in any block of text☆169Updated 2 months ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Updated 5 years ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆39Updated 4 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆296Updated 3 months ago
- Named-Entity Recognition extension for OpenRefine☆30Updated 2 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆39Updated last year
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆18Updated last year
- Structured data for classical studies☆19Updated 9 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆195Updated 3 months ago
- Legal citation extractor, via command line, JavaScript, or HTTP. See a live example at:☆240Updated 5 years ago
- A commandline tool and Python library for archiving data from Facebook using the Graph API.☆79Updated 7 years ago
- The sequel to Big Cases Bot☆25Updated last month
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated last year
- CollectionBuilder-CSV is a "stand alone" template for creating digital collection and exhibit websites using Jekyll and a metadata CSV.☆33Updated this week
- An API to scrape American court websites for metadata.☆482Updated this week
- A tool to detect whether a PDF has a bad redaction☆149Updated last month
- Create local backups of airtable databases☆36Updated 2 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆122Updated this week
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Python script for converting MBOX files to CSV.☆90Updated 3 years ago
- Conversions between various OCR formats☆80Updated 2 years ago
- A community-curated collection of judge profile pics that can be integrated anywhere☆27Updated last week
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- Reading legal authority for the last time☆40Updated 6 months ago
- The Syriac New Testament in Text-Fabric☆13Updated last year