chrismattmann / imagecat
ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆95Updated 6 years ago
Alternatives and similar repositories for imagecat:
Users that are interested in imagecat are comparing it to the libraries listed below
- Topic modeling web application☆40Updated 9 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆122Updated 9 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Updated 8 years ago
- Interactive Image similarity and Visual Search and Retrieval application☆95Updated 10 months ago
- ☆43Updated 9 years ago
- MITIE: library and tools for information extraction☆29Updated 10 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74Updated last year
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 8 years ago
- ☆20Updated 7 years ago
- General Architecture for Text Engineering☆48Updated 8 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 9 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Facet Search interface for MEMEX.☆13Updated 9 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 10 months ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 9 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 2 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆36Updated 10 months ago
- Tools for iterative knowledge base development with DeepDive☆118Updated 6 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆33Updated last year
- Open source large document set visualization platform☆268Updated 2 years ago
- DARPA MEMEX project Vagrant VM☆53Updated 8 years ago
- A POC at replicating Facebook Graph Search with Cypher and Neo4j☆102Updated 11 years ago
- Stanford Pattern-based Information Extraction and Diagnostics -- Visualization☆93Updated 10 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 3 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated last year
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago
- People. Places. Things. Graphs.☆92Updated 10 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago