lintool / my-data-is-bigger-than-your-dataLinks
My data is bigger than your data!
☆39Updated 6 years ago
Alternatives and similar repositories for my-data-is-bigger-than-your-data
Users that are interested in my-data-is-bigger-than-your-data are comparing it to the libraries listed below
Sorting:
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- ☆49Updated 7 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- A java library for stored queries☆16Updated last year
- A template-based cluster provisioning system☆61Updated 2 years ago
- Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆21Updated last year
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 6 years ago
- Automatically exported from code.google.com/p/segment-trees☆11Updated 9 years ago
- Java and Scala client libraries for Concord☆13Updated 8 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Spash☆24Updated 9 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- YCB Java☆27Updated 2 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆18Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Updated 4 years ago
- ☆43Updated 3 years ago
- ☆17Updated 10 years ago
- Muppet☆126Updated 4 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Port of Twitter's Scala JVM-profiler to Java☆15Updated 2 years ago
- Atomix Jepsen tests☆14Updated 8 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆139Updated 8 years ago
- A/B experiments service☆33Updated last month
- Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.☆47Updated this week
- Builds a single node Druid cluster using Vagrant☆9Updated 9 years ago