PublicI / pdf-gcv-ocr
Tool to OCR PDFs using Google Cloud Vision
☆42Updated 2 years ago
Alternatives and similar repositories for pdf-gcv-ocr:
Users that are interested in pdf-gcv-ocr are comparing it to the libraries listed below
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- A database of courts, tests and other experiments☆73Updated 2 weeks ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Updated 5 years ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated 2 months ago
- An online annotation platform for teaching and learning in the humanities.☆107Updated 2 months ago
- Named-Entity Recognition extension for OpenRefine☆28Updated 2 years ago
- A database of court reporters, tests and other experiments☆105Updated 2 weeks ago
- A collection of regular expressions to identify references to state laws.☆18Updated 9 years ago
- A collection of interesting software, processes, and methodologies built and used across Public Media.☆17Updated 5 years ago
- Reading legal authority for the last time☆37Updated last month
- searching large heterogenous data dumps with Universal Sentence Encoder☆62Updated 3 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- Extract case law citations with Node☆57Updated 10 years ago
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆17Updated last year
- A collection of regular expressions for matching citations to state, federal, and even international law☆34Updated 3 years ago
- Conversions between various OCR formats☆75Updated last year
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 2 months ago
- Process, enhance and evaluate multiple OCR output.☆22Updated 6 months ago
- ☆14Updated last year
- Structured data for classical studies☆19Updated 8 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆36Updated last year
- an extensible tool to generate hyperlinks from legal citations☆33Updated 6 months ago
- Examples for getting started using https://case.law☆65Updated 2 years ago
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the co…☆83Updated 3 years ago
- The Syriac New Testament in Text-Fabric☆13Updated last year
- Easily download U.S. census maps☆33Updated 2 years ago
- Add website scraping abilities to Datasette☆62Updated 2 years ago
- A simple catalog of Twitter ID Datasets☆28Updated 4 months ago