andrewdefries / TesseractOCR
Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick
☆12Updated 9 years ago
Alternatives and similar repositories for TesseractOCR:
Users that are interested in TesseractOCR are comparing it to the libraries listed below
- Agent-based SImulation, MOdeling, and Visualization of processes☆15Updated 9 years ago
- Lexical categorization engine for large datasets. Good for NLP and Data Mining.☆104Updated 8 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- A small Docker built for the OCRopus OCR system.☆19Updated 7 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Updated 11 years ago
- A way to build and explore webs of ideas.☆15Updated 8 years ago
- ☆13Updated 9 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Updated 3 years ago
- GeoReporter Android source code. Native Android smartphone client app for Open311 API civic issue reporting.☆31Updated 9 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 8 years ago
- Various Python scripts to scrape sites that store data about you.☆28Updated 11 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated last year
- Visualization Storytelling Components☆32Updated 10 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- A JavaScript weather demo for SpreadsheetDB☆14Updated 7 years ago
- Blog crawler for the blogforever project.☆22Updated 10 years ago
- Citizen Relationship Management Open Semantic Platform☆10Updated 5 years ago
- A cypher browser based on sigmajs☆17Updated 9 years ago
- A sample app that combines geolocated entities from Freebase with Maps API☆41Updated 10 years ago
- LIBRE = Libre Information Batch Restructuring Engine.☆70Updated 10 years ago
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆17Updated 11 months ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 10 years ago