ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆95Aug 26, 2018Updated 7 years ago
Alternatives and similar repositories for imagecat
Users that are interested in imagecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interactive Image similarity and Visual Search and Retrieval application☆95Apr 16, 2024Updated 2 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆14Mar 9, 2016Updated 10 years ago
- Topic modeling web application☆40Jul 23, 2015Updated 10 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆128Jan 19, 2016Updated 10 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Feb 10, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.☆79Jul 28, 2025Updated 9 months ago
- ☆44Jan 15, 2016Updated 10 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- ☆20Nov 1, 2017Updated 8 years ago
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 10 years ago
- A distributed, parallelized (Map Reduce) wrapper around Apache RAT™ to allow it to complete on large code repositories of multiple file t…☆31Feb 4, 2020Updated 6 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34May 3, 2023Updated 3 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆12Apr 8, 2026Updated 3 weeks ago
- Numerous tools for text processing☆74Jul 30, 2017Updated 8 years ago
- Extract and Visualize location from any file☆55Apr 27, 2023Updated 3 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Apr 9, 2024Updated 2 years ago
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14May 16, 2017Updated 8 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- ☆13Nov 30, 2015Updated 10 years ago
- For extracting measurements and related entities from text☆58May 6, 2020Updated 5 years ago
- ACHE is a web crawler for domain-specific search.☆483Aug 31, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Oct 16, 2015Updated 10 years ago
- A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.☆13Sep 8, 2017Updated 8 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74May 23, 2023Updated 2 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Mar 21, 2019Updated 7 years ago
- Convert Point cloud data to Cloud Optimized GeoTIFF using AWS Lambda☆15Apr 27, 2020Updated 6 years ago
- Java library for generation and validation of software licenses (forked from OddSource/java-license-manager).☆12Nov 23, 2023Updated 2 years ago
- A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…☆21Jun 18, 2024Updated last year
- Polar USC activities related to NSF Polar CyberInfrastructure program at the University of Southern California☆15Jan 15, 2023Updated 3 years ago
- R files containing the code used to predict rugby world cup matches☆11Sep 18, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using data to dig into the 2015 NL Cy Young race☆10Nov 19, 2015Updated 10 years ago
- The Tor Path Simulator☆87Jan 16, 2017Updated 9 years ago
- Solutions for various crackmes☆20Jan 13, 2013Updated 13 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- Highlight and select phrases in HTML pages.☆24Nov 4, 2019Updated 6 years ago
- Mirror of Apache OODT☆64Apr 17, 2023Updated 3 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Dec 1, 2015Updated 10 years ago