PublicI / pdf-gcv-ocr
Tool to OCR PDFs using Google Cloud Vision
☆40Updated 2 years ago
Alternatives and similar repositories for pdf-gcv-ocr:
Users that are interested in pdf-gcv-ocr are comparing it to the libraries listed below
- A database of courts, tests and other experiments☆68Updated 2 weeks ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Updated 5 years ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- A database of court reporters, tests and other experiments☆102Updated 2 weeks ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆187Updated last month
- The sequel to Big Cases Bot☆22Updated last week
- A collection of regular expressions to identify references to state laws.☆18Updated 9 years ago
- Extract case law citations with Node☆56Updated 10 years ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆147Updated last year
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆17Updated last year
- Python script for converting MBOX files to CSV.☆89Updated 3 years ago
- OCRmyPDF EasyOCR plugin☆67Updated 6 months ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆33Updated 3 years ago
- Efficient hOCR tooling☆42Updated 3 weeks ago
- Web based JavaScript GUI library for proofreading/editing hOCR☆94Updated 6 years ago
- Building scantailor and its dependencies☆59Updated last year
- Structured data for classical studies☆18Updated 8 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆36Updated last year
- Comparing warc files☆16Updated 6 years ago
- Presentations on Quantified Self and Self-Tracking with Python☆29Updated 2 years ago
- Create and execute FFmpeg commands☆27Updated 3 years ago
- guides and test data for OCR4all☆30Updated 2 years ago
- Working with hOCR in Javascript☆126Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Create local backups of airtable databases☆36Updated last year
- Conversions between various OCR formats☆74Updated last year
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- Wordpress Plugin for Calibre☆12Updated 11 years ago
- A Free Database of Legal Materials☆25Updated 5 years ago