PublicI / pdf-gcv-ocrLinks
Tool to OCR PDFs using Google Cloud Vision
☆42Updated 3 years ago
Alternatives and similar repositories for pdf-gcv-ocr
Users that are interested in pdf-gcv-ocr are comparing it to the libraries listed below
Sorting:
- A database of court reporters, tests and other experiments☆120Updated this week
- A database of courts, tests and other experiments☆98Updated 3 weeks ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 5 years ago
- Python script for converting MBOX files to CSV.☆91Updated 4 years ago
- Create local backups of airtable databases☆36Updated 2 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆303Updated 8 months ago
- Ergonomic line-by-line transcription of scanned text.☆54Updated 5 years ago
- Find legal citations in any block of text☆206Updated 3 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆198Updated 8 months ago
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆18Updated 2 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆39Updated 2 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Updated 2 years ago
- an extensible tool to generate hyperlinks from legal citations☆41Updated last week
- Tools to process books in a cloud based pipeline system☆65Updated last month
- Extract case law citations with Node☆59Updated 11 years ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆40Updated 4 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- Named-Entity Recognition extension for OpenRefine☆30Updated 3 years ago
- Comparing warc files☆17Updated 6 years ago
- The sequel to Big Cases Bot☆28Updated 3 weeks ago
- A simple audio file transcriber that uses the Google Cloud Speech API for transcription.☆26Updated 7 years ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Updated 6 years ago
- Reading legal authority for the last time☆42Updated 10 months ago
- Social Feed Manager user interface application.☆156Updated last year
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆132Updated 2 months ago
- A collection of tools for archiving and analysing the internet.☆77Updated 3 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆134Updated 3 weeks ago
- Easily display Zotero items on a webpage☆32Updated 2 years ago
- An API to scrape American court websites for metadata.☆532Updated this week
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆55Updated 2 months ago