yaooqinn / tpcds-for-sparkLinks
☆23Updated 7 years ago
Alternatives and similar repositories for tpcds-for-spark
Users that are interested in tpcds-for-spark are comparing it to the libraries listed below
Sorting:
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆426Updated last week
- Remote Shuffle Service for Flink☆191Updated 2 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Updated 2 years ago
- Testing Sandbox for Hadoop Ecosystem Components☆36Updated this week
- Benchmarks for Apache Flink☆178Updated 2 months ago
- Benchmarks for queries over continuous data streams.☆358Updated 8 months ago
- ☆389Updated last year
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆261Updated last year
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆976Updated last week
- Client libraries of end users of Apache Kyuubi☆11Updated 2 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆283Updated 2 weeks ago
- Compass is a task diagnosis platform for bigdata☆397Updated 9 months ago
- Gluten: Plugin to Boost Trino's Performance☆75Updated last year
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Updated 2 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 6 years ago
- ☆21Updated 2 years ago
- ☆106Updated 2 years ago
- Shuttle:High Available, High Performance Remote Shuffle Service☆156Updated 2 years ago
- Web ui for Apache Paimon.☆142Updated 11 months ago
- An Extensible Data Skipping Framework☆47Updated last month
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- Flink Agents is an Agentic AI framework based on Apache Flink☆139Updated this week
- ☆17Updated last year
- Apache Kyuubi Site☆12Updated last month
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆134Updated 2 years ago
- Polycat is a cutting-edge cloud-native metastore system, purpose-built to cater to the demands of modern data management in lakehouse dep…☆18Updated last year
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆141Updated 2 years ago
- Spark ClickHouse Connector build on DataSourceV2 API☆204Updated last week
- ☆13Updated 3 years ago