Interactive Audience Analytics with Spark and HyperLogLog
☆55Oct 14, 2015Updated 10 years ago
Alternatives and similar repositories for spark-hyperloglog
Users that are interested in spark-hyperloglog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- just some scripts that I use☆27Dec 19, 2012Updated 13 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆84Apr 12, 2022Updated 4 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Apr 18, 2017Updated 9 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Feb 9, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Embedded Kafka for testing and quick prototyping.☆14Apr 19, 2016Updated 10 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Coursera Machine Learning class examples in Spark☆42Feb 14, 2014Updated 12 years ago
- An application to monitor and drive the Spark JobServer☆11Dec 12, 2014Updated 11 years ago
- Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird☆27Oct 6, 2018Updated 7 years ago
- Decorators/State & View Models for Ember.js applications☆11Sep 9, 2016Updated 9 years ago
- Sample of resteasy-netty project☆17Jun 25, 2015Updated 10 years ago
- Locality Sensitive Hashing for Apache Spark☆198Nov 1, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Aug 14, 2014Updated 11 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Feb 1, 2018Updated 8 years ago
- sparkhello: Scala to Spark - Hello World☆19Jul 12, 2017Updated 8 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Ansible Role to install a Hadoop Cluster☆10Sep 21, 2020Updated 5 years ago
- ☆11Oct 8, 2015Updated 10 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Example demonstrating a Scala project that builds using Gradle, produces a shadow jar suitable for spark-submit, and has tests using Scal…☆18Jun 18, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Apr 8, 2016Updated 10 years ago
- Sparse feature extraction with Spark☆30Jul 25, 2018Updated 7 years ago
- These are some code examples☆56Jan 12, 2020Updated 6 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Mar 16, 2015Updated 11 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Scala and SQL happy together.☆29Dec 13, 2016Updated 9 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Notes from 100 days with Kubernetes☆31Jan 25, 2019Updated 7 years ago
- Big Spatial Data Processing using Spark☆146Mar 7, 2017Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 8 years ago
- Abstract Algebra for Scala☆2,297Nov 21, 2025Updated 6 months ago
- Charmander Scheduler Lab - Mesos, Docker, InfluxDB, Spark☆67May 30, 2016Updated 9 years ago
- Low level integration of Spark and Kafka☆131Mar 15, 2018Updated 8 years ago