apache / iceberg-docs
Apache Iceberg Documentation Site
☆42Updated last year
Alternatives and similar repositories for iceberg-docs:
Users that are interested in iceberg-docs are comparing it to the libraries listed below
- Spline agent for Apache Spark☆191Updated this week
- Trino Connector for Apache Paimon.☆31Updated 2 months ago
- A Persistent Key-Value Store designed for Streaming processing☆70Updated this week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- Benchmarks for Apache Flink☆173Updated this week
- Framework for running macro benchmarks in a clustered environment☆32Updated 2 weeks ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆401Updated this week
- Storage connector for Trino☆104Updated last week
- Apache Flink Website☆150Updated last week
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- Gluten: Plugin to Boost Trino's Performance☆71Updated last year
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆278Updated this week
- ☆80Updated this week
- Remote Shuffle Service for Flink☆189Updated 2 years ago
- All the things about TPC-DS in Apache Spark☆104Updated last year
- Spark ClickHouse Connector build on DataSourceV2 API☆193Updated last month
- An Extensible Data Skipping Framework☆43Updated last month
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Updated 2 years ago
- ☆63Updated 6 months ago
- A load balancer / proxy / gateway for prestodb☆357Updated 7 months ago
- Benchmarks for queries over continuous data streams.☆330Updated 2 months ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- The Internals of Delta Lake☆183Updated last month
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆133Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆119Updated this week
- ☆184Updated this week
- Apache Spark Kubernetes Operator☆95Updated this week
- Apache Flink shaded artifacts repository☆136Updated this week
- Apache flink☆31Updated 3 weeks ago