Port of TPC-DS dsdgen to Java
☆22Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for tpcds
Users that are interested in tpcds are comparing it to the libraries listed below
Sorting:
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Framework for running macro benchmarks in a clustered environment☆37Mar 5, 2025Updated 11 months ago
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- Base POM for Airlift☆53Updated this week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆17Jan 4, 2026Updated 2 months ago
- A Rust SDK for StateFun (https://flink.apache.org/stateful-functions.html)☆32Jul 4, 2023Updated 2 years ago
- Apache DataFusion Benchmarks☆23Dec 31, 2025Updated 2 months ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Jan 4, 2019Updated 7 years ago
- Gluten: Plugin to Boost Trino's Performance☆76Oct 25, 2023Updated 2 years ago
- Frequently Asked Questions about PyFlink☆24Mar 1, 2023Updated 3 years ago
- ☆21Jun 13, 2023Updated 2 years ago
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆445Updated this week
- All the things about TPC-DS in Apache Spark☆109Jun 15, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆889Feb 9, 2026Updated 3 weeks ago
- Example application written using Reboot☆11Jan 24, 2026Updated last month
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆304Oct 30, 2025Updated 4 months ago
- Community Java bindings for https://github.com/facebookincubator/velox☆40Updated this week
- ☆393Jan 25, 2024Updated 2 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 10 months ago
- Discovery Server☆55May 7, 2024Updated last year
- Testing Sandbox for Hadoop Ecosystem Components☆44Feb 25, 2026Updated last week
- ☆33May 9, 2025Updated 9 months ago
- ☆36Nov 11, 2022Updated 3 years ago
- ☆17Jan 23, 2026Updated last month
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 4 years ago
- 一个 AI 聚合应用,一次提问,自动提交多个 AI 助手回答☆24Dec 18, 2025Updated 2 months ago
- ☆10Mar 19, 2024Updated last year
- ☆34Mar 30, 2021Updated 4 years ago
- Json/Protobuf convertors for ScalaPB use circe☆47Updated this week
- Presto connector for Apache Paimon.☆12Jan 19, 2025Updated last year
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Cloudera CDP SDK for Java☆16Updated this week
- Advanced futures library☆16Feb 3, 2026Updated last month
- The NVRC project provides a Rust binary that implements a simple init system for microVMs.☆25Feb 24, 2026Updated last week
- 日常笔记,面试题,spark源码剖析笔记☆13May 31, 2020Updated 5 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- 基于TBSchedule开发的一个分布式任务调度框架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago