gojiplus / image-to-textLinks
Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
☆16Updated 6 years ago
Alternatives and similar repositories for image-to-text
Users that are interested in image-to-text are comparing it to the libraries listed below
Sorting:
- Monitor datasets, gets alerts when something happens☆210Updated 7 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Updated 9 years ago
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- Newsclipse: The IDE for news production.☆91Updated 11 years ago
- ☆36Updated 2 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Updated 6 years ago
- (Python) Execute tesseract OCR on a multi-page PDF.☆19Updated 2 years ago
- A tool for the geospatial analysis, literary network visualization, and plot mapping of ancient texts☆15Updated 7 years ago
- A personal document with reports, analysis, and plotting of personal analytics data using R.☆13Updated 9 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Updated last month
- Share a unique URL to get feedback & tips anonymously☆20Updated 6 years ago
- Utilities for retrieving whitehouse.gov transcripts and matching news quotes to them☆16Updated 11 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 5 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated 2 years ago
- A little artoo.js bookmarklet to scrape and download the wanted or missing person lists from Interpol.☆12Updated 11 years ago
- This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by …☆28Updated 12 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Quill Grammar App☆11Updated 7 years ago
- The news homepage archive☆80Updated 4 years ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 10 years ago
- GenderTracker is a service that decomposes articles and computes various gender-related metrics based on the content.☆25Updated 12 years ago
- Accessiblity tool for use the current window with OCR technique☆19Updated last year
- Responsively embed DocumentCloud pages.☆22Updated 7 years ago
- a framework and language for exploring and analyzing feeds of social media data.☆23Updated 14 years ago
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Updated 12 years ago
- Labeled segmentation for the document structure of printed books☆16Updated 8 years ago
- Pure python script that takes user query and summarizes news related to it.☆25Updated 3 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 10 years ago