gojiplus / image-to-textLinks
Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
☆15Updated 5 years ago
Alternatives and similar repositories for image-to-text
Users that are interested in image-to-text are comparing it to the libraries listed below
Sorting:
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Updated 9 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Updated 12 years ago
- Language checker and hyphenator extension for LibreOffice☆12Updated 5 years ago
- Utilities for retrieving whitehouse.gov transcripts and matching news quotes to them☆16Updated 10 years ago
- (Python) Execute tesseract OCR on a multi-page PDF.☆19Updated 2 years ago
- Parse OCR result files for pagenos, tables of contents, etc.☆14Updated 13 years ago
- Python scraper to get weekly CDC flu surveillance data☆25Updated 10 years ago
- The BITS Lab STACK tool for social media collection and analysis.☆39Updated 2 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Python natural language processing work☆29Updated 16 years ago
- A contextual news development environment.☆49Updated 10 years ago
- Workshop bringing together individuals interested in developing curriculum, workflows, and tools to strengthen reproducibility in researc…