PublicI / pdf-gcv-ocrLinks
Tool to OCR PDFs using Google Cloud Vision
☆42Updated 2 years ago
Alternatives and similar repositories for pdf-gcv-ocr
Users that are interested in pdf-gcv-ocr are comparing it to the libraries listed below
Sorting:
- A database of court reporters, tests and other experiments☆116Updated last week
- Ergonomic line-by-line transcription of scanned text.☆54Updated 4 years ago
- Named-Entity Recognition extension for OpenRefine☆29Updated 2 years ago
- A database of courts, tests and other experiments☆94Updated last week
- A commandline tool and Python library for archiving data from Facebook using the Graph API.☆78Updated 7 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆38Updated last year
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆298Updated 4 months ago
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- an extensible tool to generate hyperlinks from legal citations☆38Updated last year
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆98Updated 3 years ago
- Find legal citations in any block of text☆176Updated 2 weeks ago
- Extract case law citations with Node☆58Updated 11 years ago
- Reading legal authority for the last time☆40Updated 7 months ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆107Updated 4 years ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆129Updated 2 months ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.☆148Updated last year
- Core development repository. gitHub: Vsn 6 (2020 - ), Vsn 5 (2018 - 2020), Vsn 4 (2014-2017). Sourceforge: Vsn 3 (2009-2013), Vsn 1 & 2 (…☆64Updated this week
- CollectionBuilder-CSV is a "stand alone" template for creating digital collection and exhibit websites using Jekyll and a metadata CSV.☆34Updated 2 weeks ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆40Updated 4 years ago
- Examples for getting started using https://case.law☆69Updated 3 years ago
- Social Feed Manager user interface application.☆156Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆50Updated last week
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆195Updated 4 months ago
- pythonic interface to the courtlistener api☆20Updated 6 years ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆131Updated 2 weeks ago
- Easily display Zotero items on a webpage☆32Updated 2 years ago
- The sequel to Big Cases Bot☆26Updated last week
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- A Rails engine supporting the discovery of web archives.☆50Updated 2 years ago