cloudera / kudu

☆881

Related projects: ⓘ

cloudera / Impala
Real-time Query for Hadoop; mirror of Apache Impala
☆31Updated last year
apache / kudu
Mirror of Apache Kudu
☆1,841Updated this week
TIBCOSoftware / snappydata
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,039Updated last year
LinkedInAttic / camus
LinkedIn's previous generation Kafka to HDFS pipeline.
☆882Updated 4 years ago
apache / hawq
Apache HAWQ
☆696Updated 4 months ago
druid-io / tranquility
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…
☆516Updated 4 years ago
apache / phoenix
Apache Phoenix
☆1,021Updated this week
apache / samza
Mirror of Apache Samza
☆811Updated 3 weeks ago
apache / oozie
Mirror of Apache Oozie
☆707Updated 2 months ago
apache / tez
Apache Tez
☆471Updated this week
cloudera / livy
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,008Updated last year
linkedin / dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,353Updated last year
apache / orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
☆678Updated this week
yahoo / streaming-benchmarks
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
☆625Updated 9 months ago
apache / impala
Apache Impala
☆1,123Updated this week
Parquet / parquet-mr
☆236Updated this week
uber-archive / AthenaX
SQL-based streaming analytics platform at scale
☆1,222Updated 4 years ago
apache / apex-core
Mirror of Apache Apex core
☆350Updated 3 years ago
hbutani / spark-druid-olap
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…
☆285Updated 6 years ago
apache / gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,214Updated this week
apache / carbondata
High performance data store solution
☆1,432Updated 2 months ago
sameeragarwal / blinkdb
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
☆660Updated 10 years ago
apache / drill
Apache Drill is a distributed MPP query layer for self describing data
☆1,928Updated 3 weeks ago
RedisLabs / spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
☆935Updated 3 months ago
apache / eagle
Mirror of Apache Eagle
☆408Updated 4 years ago
shunfei / indexr
An open-source columnar data format designed for fast & realtime analytic with big data.
☆453Updated last year
kite-sdk / kite
Kite SDK
☆394Updated last year
apache / sqoop
Mirror of Apache Sqoop
☆969Updated 3 years ago
KylinOLAP / Kylin
This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one
☆562Updated last year
airbnb / reair
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
☆279Updated 5 years ago