ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
☆95Aug 26, 2018Updated 7 years ago
Alternatives and similar repositories for imagecat
Users that are interested in imagecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interactive Image similarity and Visual Search and Retrieval application☆95Apr 16, 2024Updated last year
- Browser add-on and web server to support collection and analysis of web browsing data.☆14Mar 9, 2016Updated 10 years ago
- Topic modeling web application☆40Jul 23, 2015Updated 10 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆128Jan 19, 2016Updated 10 years ago
- Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.☆78Jul 28, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆44Jan 15, 2016Updated 10 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- ☆20Nov 1, 2017Updated 8 years ago
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 10 years ago
- A distributed, parallelized (Map Reduce) wrapper around Apache RAT™ to allow it to complete on large code repositories of multiple file t…☆31Feb 4, 2020Updated 6 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- A project to attempt to automatically login to a website given a single seed☆129Updated this week
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆12Updated this week
- Numerous tools for text processing☆74Jul 30, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Extract and Visualize location from any file☆55Apr 27, 2023Updated 2 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Apr 9, 2024Updated 2 years ago
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14May 16, 2017Updated 8 years ago
- Problem Sets for Jour72326: Scraping for Journalists.☆20May 22, 2017Updated 8 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- ☆10Jan 15, 2017Updated 9 years ago
- ☆13Nov 30, 2015Updated 10 years ago
- Columbia Image and Face Search tool for MEMEX☆58Feb 6, 2020Updated 6 years ago
- For extracting measurements and related entities from text☆58May 6, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ACHE is a web crawler for domain-specific search.☆483Aug 31, 2025Updated 7 months ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Oct 16, 2015Updated 10 years ago
- deep version SentiBank☆12Dec 16, 2014Updated 11 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74May 23, 2023Updated 2 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Mar 21, 2019Updated 7 years ago
- Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.☆423Mar 30, 2023Updated 3 years ago
- Convert Point cloud data to Cloud Optimized GeoTIFF using AWS Lambda☆15Apr 27, 2020Updated 5 years ago
- Java library for generation and validation of software licenses (forked from OddSource/java-license-manager).☆12Nov 23, 2023Updated 2 years ago
- A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…☆21Jun 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Polar USC activities related to NSF Polar CyberInfrastructure program at the University of Southern California☆15Jan 15, 2023Updated 3 years ago
- R files containing the code used to predict rugby world cup matches☆10Sep 18, 2015Updated 10 years ago
- A web application that recommends songs via "country arithmetic" and hand-rolled Implicit Matrix Factorization☆10May 5, 2017Updated 8 years ago
- Using data to dig into the 2015 NL Cy Young race☆10Nov 19, 2015Updated 10 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- The Tor Path Simulator☆87Jan 16, 2017Updated 9 years ago
- Solutions for various crackmes☆20Jan 13, 2013Updated 13 years ago