ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆95Aug 26, 2018Updated 7 years ago
Alternatives and similar repositories for imagecat
Users that are interested in imagecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interactive Image similarity and Visual Search and Retrieval application☆95Apr 16, 2024Updated last year
- Facet Search interface for MEMEX.☆13Feb 26, 2015Updated 11 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆14Mar 9, 2016Updated 10 years ago
- Topic modeling web application☆40Jul 23, 2015Updated 10 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆127Jan 19, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Faceted search engine for domain-specific exploration of the Web☆45Feb 10, 2017Updated 9 years ago
- Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.☆77Jul 28, 2025Updated 7 months ago
- ☆44Jan 15, 2016Updated 10 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- ☆20Nov 1, 2017Updated 8 years ago
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 10 years ago
- A distributed, parallelized (Map Reduce) wrapper around Apache RAT™ to allow it to complete on large code repositories of multiple file t…☆31Feb 4, 2020Updated 6 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A project to attempt to automatically login to a website given a single seed☆129Updated this week
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆12Feb 23, 2026Updated last month
- Numerous tools for text processing☆74Jul 30, 2017Updated 8 years ago
- Extract and Visualize location from any file☆55Apr 27, 2023Updated 2 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Apr 9, 2024Updated last year
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14May 16, 2017Updated 8 years ago
- Problem Sets for Jour72326: Scraping for Journalists.☆20May 22, 2017Updated 8 years ago
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆121Updated this week
- Columbia Image and Face Search tool for MEMEX☆58Feb 6, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- For extracting measurements and related entities from text☆58May 6, 2020Updated 5 years ago
- ACHE is a web crawler for domain-specific search.☆479Aug 31, 2025Updated 6 months ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Oct 16, 2015Updated 10 years ago
- deep version SentiBank☆12Dec 16, 2014Updated 11 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74May 23, 2023Updated 2 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Mar 21, 2019Updated 7 years ago
- Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.☆420Mar 30, 2023Updated 2 years ago
- Java library for generation and validation of software licenses (forked from OddSource/java-license-manager).☆12Nov 23, 2023Updated 2 years ago
- A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…☆21Jun 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A port of parts of Python's itertools (and maybe stuff from more-itertools, etc.) to Swift☆17Feb 17, 2016Updated 10 years ago
- R files containing the code used to predict rugby world cup matches☆10Sep 18, 2015Updated 10 years ago
- A web application that recommends songs via "country arithmetic" and hand-rolled Implicit Matrix Factorization☆10May 5, 2017Updated 8 years ago
- Using data to dig into the 2015 NL Cy Young race☆10Nov 19, 2015Updated 10 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- Meta information for the DARPA open catalog project.☆57Nov 16, 2017Updated 8 years ago
- MEMEX Weapons Pilot for the illegal weapons domain.☆15May 20, 2016Updated 9 years ago