andrewdefries / TesseractOCR
Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick
☆11Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for TesseractOCR
- Citizen Relationship Management Open Semantic Platform☆10Updated 5 years ago
- The more often you click a word in the headlines, the more interesting are your news.☆13Updated 7 years ago
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- Cuts movie dialog summary video.☆10Updated 8 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- natural language processing with link-grammar☆18Updated 15 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- List of tools and Utilities for Data and Information Visualization. Ever Expanding list with Insights into some of the most happening Fra…☆39Updated 9 years ago
- Best CRM Software for Startups☆54Updated 10 years ago
- Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JS…☆156Updated 7 years ago
- How to spot first stories on Twitter using Storm.☆125Updated 11 months ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago
- ☆30Updated 12 years ago
- A small Docker built for the OCRopus OCR system.☆19Updated 6 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Updated 3 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Blog crawler for the blogforever project.☆22Updated 10 years ago
- GeoReporter Android source code. Native Android smartphone client app for Open311 API civic issue reporting.☆31Updated 8 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Updated 14 years ago
- Sauna - a social news reader and curation tool☆52Updated 10 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated last year
- An application, a library,... to control your linux desktop with your voice☆49Updated 8 years ago
- The (overall) documentation of the d:swarm platform (https://github.com/dswarm/dswarm-documentation/wiki)☆21Updated 9 years ago
- API server for TextBlob: Sentiment analysis, POS tagging, noun phrase extraction.☆23Updated 9 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Updated 11 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 11 years ago
- ☆36Updated last year