Interactive Audience Analytics with Spark and HyperLogLog
☆55Oct 14, 2015Updated 10 years ago
Alternatives and similar repositories for spark-hyperloglog
Users that are interested in spark-hyperloglog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- just some scripts that I use☆27Dec 19, 2012Updated 13 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆85Apr 12, 2022Updated 4 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆145Jan 26, 2016Updated 10 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Feb 9, 2016Updated 10 years ago
- Embedded Kafka for testing and quick prototyping.☆14Apr 19, 2016Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Coursera Machine Learning class examples in Spark☆42Feb 14, 2014Updated 12 years ago
- An application to monitor and drive the Spark JobServer☆11Dec 12, 2014Updated 11 years ago
- Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird☆27Oct 6, 2018Updated 7 years ago
- Decorators/State & View Models for Ember.js applications☆11Sep 9, 2016Updated 9 years ago
- Sample of resteasy-netty project☆17Jun 25, 2015Updated 10 years ago
- Locality Sensitive Hashing for Apache Spark☆198Nov 1, 2016Updated 9 years ago
- Application that visualizes your google location history in form of a heatmap using Spark to aggregate the data.☆12Feb 19, 2015Updated 11 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Aug 14, 2014Updated 11 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Feb 1, 2018Updated 8 years ago
- sparkhello: Scala to Spark - Hello World☆19Jul 12, 2017Updated 8 years ago
- Coding exercises for Apache Spark☆103Jun 4, 2015Updated 11 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- Ansible Role to install a Hadoop Cluster☆10Sep 21, 2020Updated 5 years ago
- ☆11Oct 8, 2015Updated 10 years ago
- Joins for skewed datasets in Spark☆58Aug 18, 2017Updated 8 years ago
- Example demonstrating a Scala project that builds using Gradle, produces a shadow jar suitable for spark-submit, and has tests using Scal…☆18Jun 18, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Collection of Interesting Algorithms☆16Oct 13, 2020Updated 5 years ago
- Write data to files split by topic and rolled over on size or a timeout, files can be compressed using lzo, snappy or gzip☆11Jul 12, 2021Updated 4 years ago
- Sparse feature extraction with Spark☆30Jul 25, 2018Updated 7 years ago
- These are some code examples☆56Jan 12, 2020Updated 6 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Mar 16, 2015Updated 11 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- Secondary sort and streaming reduce for Apache Spark☆77Jul 3, 2023Updated 2 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Notes from 100 days with Kubernetes☆31Jan 25, 2019Updated 7 years ago
- Big Spatial Data Processing using Spark☆146Mar 7, 2017Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 8 years ago
- Abstract Algebra for Scala☆2,299Nov 21, 2025Updated 6 months ago
- Low level integration of Spark and Kafka☆129Mar 15, 2018Updated 8 years ago