gojiplus / image-to-textLinks
Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
☆15Updated 6 years ago
Alternatives and similar repositories for image-to-text
Users that are interested in image-to-text are comparing it to the libraries listed below
Sorting:
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Updated 12 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Updated 9 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 7 years ago
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 10 years ago
- Quill Grammar App☆11Updated 7 years ago
- Bluemix sample app written in Python that uses the Klout and Twitter API's to analyze the influence of individual twitter usernames☆18Updated 8 years ago
- Closed Caption Transcripts of News Videos from archive.org 2014--2023☆50Updated 8 months ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago
- Code supporting the dissertation "Agents in Conflict," George Mason University, 2016☆20Updated 9 years ago
- Scraper built with Scrapy.☆18Updated last year
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 12 years ago
- (Python) Execute tesseract OCR on a multi-page PDF.☆19Updated 2 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Updated 3 weeks ago
- Code, data, and paper for Academia.edu citation advantage analysis☆31Updated 9 years ago
- Classifying the content of domains☆57Updated 3 months ago
- Command line tool for manipulating and analyzing text☆29Updated 3 years ago
- Labeled segmentation for the document structure of printed books☆16Updated 8 years ago
- Utilities for retrieving whitehouse.gov transcripts and matching news quotes to them☆16Updated 11 years ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago
- Pure python script that takes user query and summarizes news related to it.☆25Updated 3 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Python scraper to get weekly CDC flu surveillance data☆25Updated 11 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Workshop bringing together individuals interested in developing curriculum, workflows, and tools to strengthen reproducibility in researc…☆33Updated 10 years ago
- A repository of materials for a proposed class on automated story bots.☆49Updated 7 years ago
- R client for the Virustotal Public API. Virustotal is a Google service that analyzes files and URLs for viruses etc.☆12Updated 3 months ago
- The Data Journalism Handbook was born at a 48 hour workshop at MozFest 2011 in London. It subsequently spilled over into an international…☆32Updated 13 years ago
- PageOneX. Analyzing front pages☆52Updated last year