metamx / druid
Real²time Exploratory Analytics on Large Datasets
☆122Updated 5 years ago
Alternatives and similar repositories for druid:
Users that are interested in druid are comparing it to the libraries listed below
- Sql interface to druid.☆77Updated 9 years ago
- Apache Tephra: Transactions for HBase.☆157Updated 6 months ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 9 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆243Updated 9 years ago
- MySQL-like queries for Druid built on top of Plywood☆147Updated 5 years ago
- Jetstream is a streaming processing framework☆113Updated 9 years ago
- Mirror of Apache Lens☆60Updated 5 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 7 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 11 years ago
- Extensions, custom & experimental panels☆52Updated 9 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- ☆76Updated 8 years ago
- Mirror of Apache Blur☆33Updated 6 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 8 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 8 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Mirror of Apache Apex core☆349Updated 3 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…☆332Updated 6 years ago
- A Bulk Data Pipeline out of Cassandra☆323Updated 5 years ago
- Multidimensional data storage with rollups for numerical data☆266Updated last year
- Mirror of Apache Spark☆57Updated 9 years ago
- Mirror of Apache Myriad (Incubating)☆154Updated last year
- Dumps state of Storm Kafka consumers☆96Updated 7 years ago
- ☆557Updated 3 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- Mirror of Apache Crunch (Incubating)☆104Updated 4 years ago