soodoku / image-to-text
Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
☆15Updated 5 years ago
Alternatives and similar repositories for image-to-text:
Users that are interested in image-to-text are comparing it to the libraries listed below
- (Python) Execute tesseract OCR on a multi-page PDF.☆18Updated last year
- Pure python script that takes user query and summarizes news related to it.☆25Updated 2 years ago
- A scraper focused on organizational Github accounts and their members.☆41Updated 2 years ago
- Query Wikipedia articles☆18Updated 2 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Global Data Journalists Directory☆10Updated 6 years ago
- South Africa's by-laws in XML format☆18Updated 6 years ago
- ☆36Updated last year
- Tools for working with Optical Character Recognition output☆16Updated 10 years ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- Search the internet from your terminal. Speed read your results. Terminal nirvana.☆20Updated 4 years ago
- stoplists for African languages generated from the ASP corpus☆14Updated 9 years ago
- Scraper built with Scrapy.☆14Updated 6 months ago
- An online reference for data journalism☆25Updated 10 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 9 years ago
- Big Five personality traits: domains, aspects, facets☆25Updated last year
- An App Engine app that generates OPMLs from spreadsheets.☆12Updated 13 years ago
- Examples of bad data, especially from government.☆22Updated 6 months ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated 11 months ago
- A platform for tools that do stuff with data☆56Updated 6 years ago
- ☆26Updated 11 years ago
- An attempt at using as many as possible COOL computer science stuff to produce a single image (Lindenmayer system, Penrose tiling, Travel…☆17Updated 10 years ago
- Organizing and publishing the web domains of the US federal government☆16Updated 6 years ago
- Word lists for analyzing media reporting☆24Updated 6 years ago
- Bot software for creating Wikipedia articles using geographical data☆10Updated 7 years ago
- Open source universal intelligent IP video surveillance system.☆12Updated 6 years ago
- 🌱🍎🍆 A shell script to parse the data by the Food and Agriculture Organization of the United Nations on crops/fruits.☆15Updated 2 years ago
- Why are our best and most experienced employees leaving prematurely?☆12Updated 7 years ago
- Walmart Web Scraper written in Python 3 to extract coupon details for a store location☆14Updated 6 years ago
- This is a tool to generate an archive of a Facebook group's discussions.☆15Updated 6 years ago