chrismattmann / imagecatLinks
ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆96Updated 6 years ago
Alternatives and similar repositories for imagecat
Users that are interested in imagecat are comparing it to the libraries listed below
Sorting:
- General Architecture for Text Engineering☆50Updated 9 years ago
- Topic modeling web application☆41Updated 10 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆124Updated 9 years ago
- Interactive Image similarity and Visual Search and Retrieval application☆96Updated last year
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 9 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Updated 9 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- ☆44Updated 9 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74Updated 2 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 4 months ago
- ☆20Updated 7 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Simple search results with Solr and EmberJS☆58Updated 6 years ago
- Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.☆45Updated this week
- Open source large document set visualization platform☆270Updated 2 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- open source big data integration, analytics, and visualization☆420Updated 8 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.☆80Updated 3 weeks ago
- Meta information for the DARPA open catalog project.☆56Updated 7 years ago
- Mirror of Apache Stanbol (incubating)☆114Updated last year
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆95Updated 7 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago