lintool / my-data-is-bigger-than-your-dataLinks
My data is bigger than your data!
☆39Updated 6 years ago
Alternatives and similar repositories for my-data-is-bigger-than-your-data
Users that are interested in my-data-is-bigger-than-your-data are comparing it to the libraries listed below
Sorting:
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- DEPRECATED A/B experiments service☆34Updated this week
- Cantor provides utilities for estimating the cardinality of large sets.☆84Updated 3 years ago
- Cascading on Apache Flink®☆54Updated last year
- @MissAmyTobey Writes☆49Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 2 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 3 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- A tutorial that explains how to build a simple distributed fault-tolerant framework on top of Mesos☆47Updated 3 years ago
- A distributed queue built off cassandra☆51Updated 9 years ago
- ☆74Updated 7 years ago
- Muppet☆128Updated 4 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- ☆49Updated 8 years ago
- Integration of Samza and Luwak☆100Updated 11 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- Apache Yarn cluster docker image☆35Updated 8 years ago
- Compare eventual consistency of object stores☆176Updated last year
- All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…☆332Updated 7 years ago
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆18Updated 8 years ago
- Explorations relative to cloning FlumeJava☆94Updated 5 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆153Updated last year
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Pig on Apache Spark☆82Updated 10 years ago
- Query testing framework☆71Updated 5 months ago
- ☆76Updated 9 years ago
- Storm on Mesos!☆137Updated 4 years ago
- Simple Samza Job Using Confluent Platform☆14Updated 9 years ago
- ☆68Updated 9 years ago