ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆95Aug 26, 2018Updated 7 years ago
Alternatives and similar repositories for imagecat
Users that are interested in imagecat are comparing it to the libraries listed below
Sorting:
- Interactive Image similarity and Visual Search and Retrieval application☆95Apr 16, 2024Updated last year
- Topic modeling web application☆40Jul 23, 2015Updated 10 years ago
- Facet Search interface for MEMEX.☆13Feb 26, 2015Updated 11 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆14Mar 9, 2016Updated 9 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆126Jan 19, 2016Updated 10 years ago
- Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.☆77Jul 28, 2025Updated 7 months ago
- General Architecture for Text Engineering☆49Mar 23, 2016Updated 9 years ago
- ☆20Nov 1, 2017Updated 8 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Feb 10, 2017Updated 9 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- A distributed, parallelized (Map Reduce) wrapper around Apache RAT™ to allow it to complete on large code repositories of multiple file t…☆31Feb 4, 2020Updated 6 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- ☆44Jan 15, 2016Updated 10 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34May 3, 2023Updated 2 years ago
- ☆10Jan 15, 2017Updated 9 years ago
- ☆13Nov 30, 2015Updated 10 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Apr 9, 2024Updated last year
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Mar 21, 2019Updated 6 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Oct 16, 2015Updated 10 years ago
- Extract and Visualize location from any file☆55Apr 27, 2023Updated 2 years ago
- Numerous tools for text processing☆74Jul 30, 2017Updated 8 years ago
- Columbia Image and Face Search tool for MEMEX☆58Feb 6, 2020Updated 6 years ago
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆119Feb 23, 2026Updated last week
- ACHE is a web crawler for domain-specific search.☆479Aug 31, 2025Updated 6 months ago
- Problem Sets for Jour72326: Scraping for Journalists.☆20May 22, 2017Updated 8 years ago
- Loopback web application for administration of Datawake networks☆10May 2, 2017Updated 8 years ago
- Fast links parser for Python & Humans☆11Dec 27, 2012Updated 13 years ago
- RDFSpace constructs a vector space from any RDF dataset which can be used for computing similarities between resources in that dataset.☆41Nov 8, 2013Updated 12 years ago
- Some code to examine and modify your experience of Twitter.☆11May 30, 2020Updated 5 years ago
- Using data to dig into the 2015 NL Cy Young race☆10Nov 19, 2015Updated 10 years ago
- Interactive visualization of non-linear logistic regression decision boundaries☆28Jul 24, 2014Updated 11 years ago
- Docker container to provide Apache Tika RESTful API☆41Feb 12, 2016Updated 10 years ago
- For FFL Blog☆10Sep 24, 2015Updated 10 years ago
- A web application that recommends songs via "country arithmetic" and hand-rolled Implicit Matrix Factorization☆10May 5, 2017Updated 8 years ago
- Large RDF hierarchies as vector spaces☆20Jun 27, 2014Updated 11 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- WikiLeaks Cablegate Reference Network Visualization : cables.csv to graph to svg/html5☆29Apr 20, 2014Updated 11 years ago