gojiplus / image-to-textLinks
Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
☆15Updated 6 years ago
Alternatives and similar repositories for image-to-text
Users that are interested in image-to-text are comparing it to the libraries listed below
Sorting:
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- A tool for the geospatial analysis, literary network visualization, and plot mapping of ancient texts☆15Updated 7 years ago
- Image comparison QA tool for digital preservation workflows.☆14Updated 10 years ago
- Pure python script that takes user query and summarizes news related to it.☆25Updated 3 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Updated 9 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated 2 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Classifying the content of domains☆57Updated last month
- Python natural language processing work☆29Updated 16 years ago
- Labeled segmentation for the document structure of printed books☆15Updated 8 years ago
- Loose Miscellany☆21Updated 8 years ago
- The BITS Lab STACK tool for social media collection and analysis.☆39Updated 2 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Newsclipse: The IDE for news production.☆91Updated 10 years ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago
- A repository of materials for a proposed class on automated story bots.☆49Updated 7 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Bluemix sample app written in Python that uses the Klout and Twitter API's to analyze the influence of individual twitter usernames☆18Updated 8 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆46Updated 6 years ago
- Scraper built with Scrapy.☆18Updated last year
- (Python) Execute tesseract OCR on a multi-page PDF.☆19Updated 2 years ago
- Amsterdam Content Analysis Toolkit☆46Updated 3 years ago
- A place to collect and share knowledge about liberating data from PDFs☆55Updated 3 years ago
- smappPy addresses common tasks programmers dealing with lots of data☆24Updated 9 years ago
- Website for America's Public Bible☆11Updated 5 years ago
- ScraperWiki Python library for scraping and saving data☆158Updated 2 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 4 years ago