lintool / my-data-is-bigger-than-your-dataLinks
My data is bigger than your data!
☆39Updated 6 years ago
Alternatives and similar repositories for my-data-is-bigger-than-your-data
Users that are interested in my-data-is-bigger-than-your-data are comparing it to the libraries listed below
Sorting:
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆84Updated 3 years ago
- A/B experiments service☆34Updated 5 months ago
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆18Updated 8 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 3 years ago
- @MissAmyTobey Writes☆49Updated 2 years ago
- Apache Yarn cluster docker image☆35Updated 7 years ago
- A distributed queue built off cassandra☆51Updated 9 years ago
- Probabilistic data structures for Guava.☆54Updated 5 years ago
- Integration of Samza and Luwak☆100Updated 10 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Compare eventual consistency of object stores☆174Updated last year
- Cascading on Apache Flink®☆54Updated last year
- ☆74Updated 7 years ago
- ☆49Updated 8 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- This project allows to run Samza jobs on Mesos cluster☆43Updated 4 years ago
- Serving system for batch generated data sets☆177Updated 8 years ago
- All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…☆332Updated 6 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- A tutorial that explains how to build a simple distributed fault-tolerant framework on top of Mesos☆47Updated 3 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 3 years ago
- Keynote for QCon SF 2015!☆38Updated 9 years ago
- A gRPC service which proxies requests to an HTTP server.☆25Updated 7 years ago
- Improved Secondary Indexing with new Query Capabilities (OR, scoping) for Cassandra☆145Updated 9 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 10 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆83Updated 4 months ago
- Storm on Mesos!☆137Updated 4 years ago
- s3mper - Consistent Listing for S3☆230Updated 2 years ago