thammegowda / tika-dl4j-spark-imgrec
Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
☆14Updated 7 years ago
Alternatives and similar repositories for tika-dl4j-spark-imgrec:
Users that are interested in tika-dl4j-spark-imgrec are comparing it to the libraries listed below
- ☆20Updated 8 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- A Spark-based LexRank extractive summarizer for text documents☆19Updated 9 years ago
- Base components for Question Answering pipelines☆28Updated 2 years ago
- ☆11Updated 8 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆36Updated 9 months ago
- An open relation extraction system☆46Updated 3 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 3 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Mention-anomaly-based event detection and tracking in Twitter☆17Updated 8 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆57Updated 6 years ago
- Annotated Gigaword Java API and Command Line Tools☆15Updated 8 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Tools for building a Lucene index for Semantic Vectors☆21Updated 9 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 7 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- A RankLib based Solr Learning to Rank Plugin☆29Updated 2 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago
- A set of methods for automatically detecting trending topics in streams of short texts (e.g. tweets).☆52Updated 10 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆33Updated last year
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 9 years ago
- Query Expansion using word2vec☆11Updated 5 years ago
- ☆25Updated 6 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago
- CubeQA—Question Answering on Statistical Linked Data☆20Updated last year