apache/carbondata

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/carbondata)

apache / carbondata

High performance data store solution

☆1,448

Alternatives and similar repositories for carbondata

Users that are interested in carbondata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

QiangCai / carbondata_guide
View on GitHub
Apache CarbonData 源码阅读
☆61Feb 12, 2020Updated 6 years ago
apache / kylin
View on GitHub
Apache Kylin
☆3,769Updated this week
apache / kudu
View on GitHub
Mirror of Apache Kudu
☆1,904Updated this week
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
apache / calcite
View on GitHub
Apache Calcite
☆5,159Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xubo245 / CarbonDataLearning
View on GitHub
Apache CarbonData Learning
☆53Mar 5, 2020Updated 6 years ago
byzer-org / byzer-lang
View on GitHub
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
☆1,835May 29, 2024Updated 2 years ago
shunfei / indexr
View on GitHub
An open-source columnar data format designed for fast & realtime analytic with big data.
☆447Nov 16, 2022Updated 3 years ago
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,214Apr 29, 2025Updated last year
apache / drill
View on GitHub
Apache Drill is a distributed MPP query layer for self describing data
☆2,022Jul 15, 2026Updated last week
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,193Updated this week
apache / hawq
View on GitHub
Apache HAWQ
☆696May 16, 2024Updated 2 years ago
lw-lin / CoolplaySpark
View on GitHub
酷玩 Spark: Spark 源代码解析、Spark 类库等
☆3,475May 18, 2022Updated 4 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,924Updated this week
apache / livy
View on GitHub
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
☆958Jul 9, 2026Updated last week
apache / phoenix
View on GitHub
Apache Phoenix
☆1,060Updated this week
apache / eagle
View on GitHub
Mirror of Apache Eagle
☆411Aug 22, 2020Updated 5 years ago
apache / hbase
View on GitHub
Apache HBase
☆5,549Updated this week
apache / ambari
View on GitHub
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.
☆2,306Updated this week
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,117Updated this week
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,645Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
prestodb / presto
View on GitHub
The official home of the Presto distributed SQL query engine for big data
☆16,719Updated this week
openlookeng / hetu-core
View on GitHub
☆571Oct 30, 2023Updated 2 years ago
apache / druid
View on GitHub
Apache Druid: a high performance real-time analytics database.
☆14,034Updated this week
apache / ignite
View on GitHub
Apache Ignite
☆5,073Updated this week
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
apache / iceberg
View on GitHub
Apache Iceberg
☆9,067Updated this week
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
apache / flink
View on GitHub
Apache Flink
☆26,200Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
apache / griffin
View on GitHub
Mirror of Apache griffin
☆1,169Aug 3, 2025Updated 11 months ago
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,576Updated this week
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Jun 29, 2026Updated 3 weeks ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
cloudera / Impala
View on GitHub
Real-time Query for Hadoop; mirror of Apache Impala
☆34Dec 27, 2022Updated 3 years ago
apache / impala
View on GitHub
Apache Impala
☆1,279Updated this week
apache / beam
View on GitHub
Apache Beam is a unified programming model for Batch and Streaming data processing.
☆8,635Updated this week