Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets
☆92Jan 21, 2016Updated 10 years ago
Alternatives and similar repositories for correlation-approximation
Users that are interested in correlation-approximation are comparing it to the libraries listed below
Sorting:
- A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualizatio…☆14Aug 30, 2018Updated 7 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- Pattern-of-Behavior Search Tool☆11Jun 20, 2022Updated 3 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- Tools for scraping of twitter data, conversion, text analysis and graph construction☆11Aug 1, 2016Updated 9 years ago
- ☆20Mar 31, 2017Updated 8 years ago
- Big GeoSpatial Data Points Visualization Tool☆19May 6, 2016Updated 9 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Oct 5, 2021Updated 4 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Aug 8, 2016Updated 9 years ago
- ☆44Jan 15, 2016Updated 10 years ago
- ☆25Jan 26, 2016Updated 10 years ago
- Python implementation of nonparametric nearest-neighbor-based estimators for divergences between distributions.☆48Mar 13, 2017Updated 9 years ago
- Minerva: client/server/services for analysis and visualization☆37Aug 24, 2018Updated 7 years ago
- Distributed optimization framework with parameter server☆23Jun 14, 2015Updated 10 years ago
- Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks su…☆175Jan 10, 2019Updated 7 years ago
- Topic modeling web application☆40Jul 23, 2015Updated 10 years ago
- Meta information for the DARPA open catalog project.☆57Nov 16, 2017Updated 8 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Apr 12, 2022Updated 3 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆14Jul 25, 2016Updated 9 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74May 23, 2023Updated 2 years ago
- ☆46Oct 15, 2013Updated 12 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Online Multi-Class LPBoost and Gradient Boosting☆68Dec 11, 2014Updated 11 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Convex Polytope Machine☆25Feb 4, 2015Updated 11 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Nov 11, 2016Updated 9 years ago
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 9 years ago
- Neon Geo-temporal Dashboard☆14Jan 10, 2020Updated 6 years ago
- ☆21Jan 23, 2016Updated 10 years ago
- USC GoFFish Graph Analytics Framework☆33Jul 10, 2014Updated 11 years ago
- SmallK: very fast data clustering tools☆14Apr 3, 2019Updated 6 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- TBEEF, a doubly ensemble framework for recommendation and prediction problems.☆20Apr 16, 2016Updated 9 years ago
- SNAP repository for Ringo☆14Jul 25, 2017Updated 8 years ago
- A library for time series analysis on Apache Spark☆1,198Oct 13, 2020Updated 5 years ago
- Trident State implementation on top of Elasticsearch☆21May 18, 2015Updated 10 years ago
- enable rapid iteration and development of complex data pipelines☆30Mar 9, 2025Updated last year
- Randomized online matrix factorization☆139Jun 16, 2020Updated 5 years ago
- ☆11Sep 8, 2016Updated 9 years ago