ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆96Aug 26, 2018Updated 7 years ago
Alternatives and similar repositories for imagecat
Users that are interested in imagecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Facet Search interface for MEMEX.☆13Feb 26, 2015Updated 11 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆14Mar 9, 2016Updated 10 years ago
- Topic modeling web application☆40Jul 23, 2015Updated 10 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆128Jan 19, 2016Updated 10 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Feb 10, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.☆79Jul 28, 2025Updated 11 months ago
- ☆44Jan 15, 2016Updated 10 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 11 years ago
- ☆20Nov 1, 2017Updated 8 years ago
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 10 years ago
- A distributed, parallelized (Map Reduce) wrapper around Apache RAT™ to allow it to complete on large code repositories of multiple file t…☆31Feb 4, 2020Updated 6 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34May 3, 2023Updated 3 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A project to attempt to automatically login to a website given a single seed☆129Apr 8, 2026Updated 2 months ago
- Numerous tools for text processing☆74Jul 30, 2017Updated 8 years ago
- Extract and Visualize location from any file☆55Apr 27, 2023Updated 3 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Jun 5, 2026Updated last month
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14May 16, 2017Updated 9 years ago
- Problem Sets for Jour72326: Scraping for Journalists.☆20May 22, 2017Updated 9 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- ☆10Jan 15, 2017Updated 9 years ago
- ☆13Nov 30, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆121Apr 8, 2026Updated 2 months ago
- ACHE is a web crawler for domain-specific search.☆484Aug 31, 2025Updated 10 months ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Oct 16, 2015Updated 10 years ago
- deep version SentiBank☆12Dec 16, 2014Updated 11 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆75May 23, 2023Updated 3 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Mar 21, 2019Updated 7 years ago
- Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.☆422Mar 30, 2023Updated 3 years ago
- Java library for generation and validation of software licenses (forked from OddSource/java-license-manager).☆12Nov 23, 2023Updated 2 years ago
- A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…☆21Jun 18, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- R files containing the code used to predict rugby world cup matches☆11Sep 18, 2015Updated 10 years ago
- Using data to dig into the 2015 NL Cy Young race☆10Nov 19, 2015Updated 10 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- Meta information for the DARPA open catalog project.☆57Nov 16, 2017Updated 8 years ago
- MEMEX Weapons Pilot for the illegal weapons domain.☆15May 20, 2016Updated 10 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- Highlight and select phrases in HTML pages.☆24Nov 4, 2019Updated 6 years ago