lintool / my-data-is-bigger-than-your-dataLinks
My data is bigger than your data!
☆39Updated 6 years ago
Alternatives and similar repositories for my-data-is-bigger-than-your-data
Users that are interested in my-data-is-bigger-than-your-data are comparing it to the libraries listed below
Sorting:
- A java library for stored queries☆16Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- Cascading on Apache Flink®☆54Updated last year
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 6 years ago
- Muppet☆127Updated 4 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- ☆49Updated 8 years ago
- Detect memory leaks in minutes without a heap dump.☆17Updated 8 years ago
- Apache Pig plugin for Eclipse☆12Updated 8 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 8 years ago
- ☆43Updated 3 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Pivotal GemFire XD☆13Updated 4 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 5 months ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- Python Implementation of Super and Hyper Log Log Sketches☆49Updated 13 years ago
- Spash☆24Updated 9 years ago
- Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆21Updated last year
- Feature rich service discovery on ZooKeeper☆30Updated 2 years ago
- Examples of user defined functions for Apache Drill☆18Updated 8 years ago
- Query testing framework☆70Updated 2 weeks ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- Use SQL to transform your avro schema/records☆28Updated 7 years ago
- dynamically parse protobuf message then convert to avro☆25Updated 10 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 3 years ago