Sotera/correlation-approximation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sotera/correlation-approximation)

Sotera / correlation-approximation

Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets

☆91

Alternatives and similar repositories for correlation-approximation

Users that are interested in correlation-approximation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sotera / track-communities
View on GitHub
A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualizatio…
☆14Aug 30, 2018Updated 7 years ago
nasa-jpl-memex / elwha
View on GitHub
Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…
☆17Sep 11, 2015Updated 10 years ago
mitll / graph-qube
View on GitHub
Pattern-of-Behavior Search Tool
☆11Jun 20, 2022Updated 4 years ago
ContinuumIO / scrapy_scrapers
View on GitHub
Scraper built with Scrapy.
☆18Jun 25, 2026Updated 3 weeks ago
mitll / MITIE
View on GitHub
MITIE: library and tools for information extraction
☆29Jan 22, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mitll / TweetE
View on GitHub
Tools for scraping of twitter data, conversion, text analysis and graph construction
☆11Aug 1, 2016Updated 9 years ago
autonlab / ActiveSearch
View on GitHub
☆20Mar 31, 2017Updated 9 years ago
MBoustani / Khooshe
View on GitHub
Big GeoSpatial Data Points Visualization Tool
☆19May 6, 2016Updated 10 years ago
Sotera / Datawake
View on GitHub
Browser add-on and web server to support collection and analysis of web browsing data.
☆14Mar 9, 2016Updated 10 years ago
mitll / vizlinc
View on GitHub
Vizlinc
☆15Jan 14, 2016Updated 10 years ago
Sotera / newman
View on GitHub
Quickly analyze and explore email with advanced analytics and visualization.
☆56Oct 5, 2021Updated 4 years ago
djsutherland / skl-groups
View on GitHub
scikit-learn addon to operate on set/"group"-based features
☆41Aug 8, 2016Updated 9 years ago
tensorlib / tensorlib
View on GitHub
Yet another tensor library
☆23Mar 29, 2017Updated 9 years ago
mitll / topic-clustering
View on GitHub
☆44Jan 15, 2016Updated 10 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
danielpreotiuc / twitter-collection-utils
View on GitHub
Twitter data collection scripts
☆15Apr 19, 2016Updated 10 years ago
Sotera / distributed-graph-analytics
View on GitHub
Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks su…
☆177Jan 10, 2019Updated 7 years ago
ericwhyne / darpa_open_catalog
View on GitHub
Meta information for the DARPA open catalog project.
☆57Nov 16, 2017Updated 8 years ago
plamenbbn / XDATA
View on GitHub
PINT Algorithm for XDATA
☆21Nov 29, 2016Updated 9 years ago
rhiever / active-categorical-classifier
View on GitHub
A tool that evolves small brains capable of scanning and classifying an image.
☆14Jul 25, 2016Updated 9 years ago
mcapuccini / MaRe
View on GitHub
MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
☆14Apr 12, 2022Updated 4 years ago
alkant / cpm
View on GitHub
Convex Polytope Machine
☆25Feb 4, 2015Updated 11 years ago
unchartedsoftware / aperture-tiles
View on GitHub
Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.
☆75May 23, 2023Updated 3 years ago
brkyvz / lazy-linalg
View on GitHub
A package full of linear algebra operators for Apache Spark MLlib's linalg package
☆10Sep 9, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
amirsaffari / online-multiclass-lpboost
View on GitHub
Online Multi-Class LPBoost and Gradient Boosting
☆68Dec 11, 2014Updated 11 years ago
rlebret / hpca
View on GitHub
C++ implementation of the Hellinger PCA for computing word embeddings.
☆32Nov 11, 2016Updated 9 years ago
nasa-jpl-memex / memex-gate
View on GitHub
General Architecture for Text Engineering
☆50Mar 23, 2016Updated 10 years ago
NextCenturyCorporation / neon-gtd
View on GitHub
Neon Geo-temporal Dashboard
☆14Jan 10, 2020Updated 6 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
smallk / smallk.github.io
View on GitHub
SmallK: very fast data clustering tools
☆13Apr 3, 2019Updated 7 years ago
snap-stanford / snap-dev
View on GitHub
SNAP repository for Ringo
☆15Jul 25, 2017Updated 8 years ago
ermine-language / ermine-legacy
View on GitHub
DEFUNCT: use https://bitbucket.org/ermine-language/ermine-scala ; see README
☆20May 2, 2015Updated 11 years ago
fated / libcp
View on GitHub
LibCP -- A Library for Conformal Prediction
☆13Feb 26, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
redpony / creg
View on GitHub
Fast regression modeling framework
☆23Aug 2, 2018Updated 7 years ago
ChrisRackauckas / TBEEF
View on GitHub
TBEEF, a doubly ensemble framework for recommendation and prediction problems.
☆20Apr 16, 2016Updated 10 years ago
SmileWide / main
View on GitHub
The main - so far, only - repository for the SmileWide project.
☆32Mar 23, 2016Updated 10 years ago
npinto / asgd
View on GitHub
Averaged Stochastic Gradient Descent Classifiers
☆42Jul 6, 2012Updated 14 years ago
arthurmensch / modl
View on GitHub
Randomized online matrix factorization
☆140Jun 16, 2020Updated 6 years ago
sckangz / recom_mc
View on GitHub
☆11Sep 8, 2016Updated 9 years ago
abietti / stochs
View on GitHub
stochs: fast stochastic solvers for machine learning in C++ and Cython
☆27Oct 13, 2022Updated 3 years ago