chrismattmann/imagecat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chrismattmann/imagecat)

chrismattmann / imagecat

ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.

☆96

Alternatives and similar repositories for imagecat

Users that are interested in imagecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nasa-jpl-memex / image_space
View on GitHub
Interactive Image similarity and Visual Search and Retrieval application
☆95Apr 16, 2024Updated 2 years ago
Sotera / Datawake
View on GitHub
Browser add-on and web server to support collection and analysis of web browsing data.
☆14Mar 9, 2016Updated 10 years ago
nasa-jpl-memex / topic_space
View on GitHub
Topic modeling web application
☆40Jul 23, 2015Updated 11 years ago
pymonger / facetview-memex
View on GitHub
Facet Search interface for MEMEX.
☆13Feb 26, 2015Updated 11 years ago
nasa-jpl-memex / memex-explorer
View on GitHub
Viewers for statistics and dashboarding of Domain Search Engine data
☆128Jan 19, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NextCenturyCorporation / dig
View on GitHub
Faceted search engine for domain-specific exploration of the Web
☆45Feb 10, 2017Updated 9 years ago
Kitware / SMQTK
View on GitHub
Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.
☆79Jul 28, 2025Updated 11 months ago
mitll / topic-clustering
View on GitHub
☆44Jan 15, 2016Updated 10 years ago
germank / compatibility-naacl2015
View on GitHub
Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…
☆11May 11, 2015Updated 11 years ago
mille856 / CMU_memex
View on GitHub
☆20Nov 1, 2017Updated 8 years ago
apache / drat
View on GitHub
A distributed, parallelized (Map Reduce) wrapper around Apache RAT™ to allow it to complete on large code repositories of multiple file t…
☆31Feb 4, 2020Updated 6 years ago
USCDataScience / SentimentAnalysisParser
View on GitHub
Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.
☆34May 3, 2023Updated 3 years ago
nasa-jpl-memex / elwha
View on GitHub
Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…
☆17Sep 11, 2015Updated 10 years ago
TeamHG-Memex / autologin
View on GitHub
A project to attempt to automatically login to a website given a single seed
☆129Apr 8, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
TeamHG-Memex / scrapy-dockerhub
View on GitHub
[UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.
☆12Apr 8, 2026Updated 3 months ago
nasa-jpl-memex / GeoParser
View on GitHub
Extract and Visualize location from any file
☆55Apr 27, 2023Updated 3 years ago
thammegowda / tika-ner-corenlp
View on GitHub
Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
☆13Feb 26, 2022Updated 4 years ago
chrismattmann / lucene-geo-gazetteer
View on GitHub
Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.
☆38Jun 5, 2026Updated last month
sisiwei / 2015-spring-cuny-web-scraping
View on GitHub
Problem Sets for Jour72326: Scraping for Journalists.
☆20May 22, 2017Updated 9 years ago
coristig / sharks
View on GitHub
☆10Jan 15, 2017Updated 9 years ago
VIDA-NYU / memex
View on GitHub
☆13Nov 30, 2015Updated 10 years ago
TeamHG-Memex / Formasaurus
View on GitHub
Formasaurus tells you the type of an HTML form and its fields using machine learning
☆121Apr 8, 2026Updated 3 months ago
datamicroscopes / lda
View on GitHub
Latent dirichlet allocation (LDA) for datamicroscopes
☆41Oct 16, 2015Updated 10 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
USCDataScience / NLTKRest
View on GitHub
This is a REST Server endpoint built using Flask and Python.
☆24Nov 16, 2022Updated 3 years ago
VIDA-NYU / ache
View on GitHub
ACHE is a web crawler for domain-specific search.
☆485Aug 31, 2025Updated 10 months ago
generalmilk / DeepSentiBank
View on GitHub
deep version SentiBank
☆12Dec 16, 2014Updated 11 years ago
unchartedsoftware / aperture-tiles
View on GitHub
Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.
☆75May 23, 2023Updated 3 years ago
weblyzard / graphyte
View on GitHub
JavaScript based graph visualization library with emphasis on customization and modularity.
☆13Mar 21, 2019Updated 7 years ago
USCDataScience / tika-dockers
View on GitHub
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…
☆21Jun 18, 2024Updated 2 years ago
cavaunpeu / dotify
View on GitHub
A web application that recommends songs via "country arithmetic" and hand-rolled Implicit Matrix Factorization
☆10May 5, 2017Updated 9 years ago
mitll / vizlinc
View on GitHub
Vizlinc
☆15Jan 14, 2016Updated 10 years ago
gjreda / cy-young-NL-2015
View on GitHub
Using data to dig into the 2015 NL Cy Young race
☆10Nov 19, 2015Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenNewsLabs / centipede
View on GitHub
Service-based pipelines for document processing
☆17Nov 9, 2014Updated 11 years ago
torps / torps
View on GitHub
The Tor Path Simulator
☆86Jan 16, 2017Updated 9 years ago
ericwhyne / darpa_open_catalog
View on GitHub
Meta information for the DARPA open catalog project.
☆57Nov 16, 2017Updated 8 years ago
apache / oodt
View on GitHub
Mirror of Apache OODT
☆65Apr 17, 2023Updated 3 years ago
pcodding / hadoop_ctakes
View on GitHub
Hadoop integration code for working with with Apache cTAKES
☆10Feb 11, 2014Updated 12 years ago
dossier / html-highlighter
View on GitHub
Highlight and select phrases in HTML pages.
☆24Nov 4, 2019Updated 6 years ago
dselivanov / word_embeddings
View on GitHub
Experiments on english wikipedia. GloVe and word2vec.
☆13Dec 1, 2015Updated 10 years ago