Interactive Audience Analytics with Spark and HyperLogLog
☆55Oct 14, 2015Updated 10 years ago
Alternatives and similar repositories for spark-hyperloglog
Users that are interested in spark-hyperloglog are comparing it to the libraries listed below
Sorting:
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- just some scripts that I use☆27Dec 19, 2012Updated 13 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Feb 9, 2016Updated 10 years ago
- Coursera Machine Learning class examples in Spark☆43Feb 14, 2014Updated 12 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Locality Sensitive Hashing for Apache Spark☆197Nov 1, 2016Updated 9 years ago
- Decorators/State & View Models for Ember.js applications☆11Sep 9, 2016Updated 9 years ago
- ☆11Oct 8, 2015Updated 10 years ago
- Application that visualizes your google location history in form of a heatmap using Spark to aggregate the data.☆12Feb 19, 2015Updated 11 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Collection of Interesting Algorithms☆16Oct 13, 2020Updated 5 years ago
- Sample of resteasy-netty project☆17Jun 25, 2015Updated 10 years ago
- Example demonstrating a Scala project that builds using Gradle, produces a shadow jar suitable for spark-submit, and has tests using Scal…☆18Jun 18, 2015Updated 10 years ago
- A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.☆27Apr 9, 2011Updated 14 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Mar 16, 2015Updated 10 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Amazon access control challenge☆25Jun 21, 2014Updated 11 years ago
- something to help you spark☆19Jul 5, 2017Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Apr 17, 2014Updated 11 years ago
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 7 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Feb 1, 2018Updated 8 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- ☆23Jun 18, 2017Updated 8 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Oct 4, 2018Updated 7 years ago
- Spark example of collecting tweets and loading into HDFS/S3☆42Oct 2, 2013Updated 12 years ago
- Big Spatial Data Processing using Spark☆146Mar 7, 2017Updated 8 years ago
- Abstract Algebra for Scala☆2,301Nov 21, 2025Updated 3 months ago
- A SBT resolver and publisher for Google Cloud Storage☆23Dec 15, 2021Updated 4 years ago
- A neural network library which trained by Spark RDD instances.☆22Jan 5, 2016Updated 10 years ago
- Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition☆55Mar 19, 2016Updated 9 years ago
- This description to be completed later☆10Oct 7, 2018Updated 7 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- Few things we've met during our etl project based on spark☆24Mar 22, 2018Updated 7 years ago
- Sparse feature extraction with Spark☆30Jul 25, 2018Updated 7 years ago