USCDataScience / hadoop-pot
A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
☆10Updated 7 years ago
Alternatives and similar repositories for hadoop-pot
Users that are interested in hadoop-pot are comparing it to the libraries listed below
Sorting:
- CSCI-544 Final Project☆9Updated 9 years ago
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14Updated 8 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated last year
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Updated 8 years ago
- Apache OpenNLP Sandbox☆43Updated this week
- Python functions for popular relevance metrics (ndcg, err, etc)☆16Updated last year
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆17Updated last year
- Base components for Question Answering pipelines☆28Updated 2 years ago
- Social Context Analysis aNd Emotion Recognition☆12Updated 7 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Vizlinc☆14Updated 9 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 3 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- Earth Science Knowledge Graph - An Automatic Approach to Building Earth Science Knowledge Graph to Improve Data Discovery☆20Updated 3 years ago
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- ☆22Updated last year
- Training Tesseract to better extract serial numbers from images of electronic items☆9Updated 8 years ago
- ☆18Updated 7 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 9 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- ☆16Updated 8 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- Web-based synthesis of nifty NLP and entity extraction services☆13Updated 5 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- ☆20Updated 8 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago