apache / incubator-uniffle

Uniffle is a high performance, general purpose Remote Shuffle Service.

☆400

Alternatives and similar repositories for incubator-uniffle:

Users that are interested in incubator-uniffle are comparing it to the libraries listed below

Tencent / Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…
☆255Updated last year
flink-extended / flink-remote-shuffle
Remote Shuffle Service for Flink
☆189Updated 2 years ago
apache / celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆926Updated this week
nexmark / nexmark
Benchmarks for queries over continuous data streams.
☆330Updated 2 months ago
bytedance / CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
☆255Updated 9 months ago
apache / flink-benchmarks
Benchmarks for Apache Flink
☆173Updated 3 weeks ago
uber / RemoteShuffleService
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆327Updated last year
ververica / ForSt
A Persistent Key-Value Store designed for Streaming processing
☆69Updated 3 weeks ago
oap-project / gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
☆257Updated 2 years ago
linkedin / transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆299Updated last year
apache / paimon-webui
Web ui for Apache Paimon.
☆137Updated 4 months ago
oap-project / Gluten-Trino
Gluten: Plugin to Boost Trino's Performance
☆70Updated last year
apache / incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,265Updated this week
ExpediaGroup / waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
☆277Updated last week
Intel-bigdata / SSM
Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution
☆140Updated 2 years ago
alibaba / SparkCube
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
☆133Updated last year
linkedin / coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆822Updated last week
apache / paimon-trino
Trino Connector for Apache Paimon.
☆31Updated 2 months ago
hortonworks / hive-testbench
☆381Updated last year
apache / amoro
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
☆921Updated this week
apache / doris-spark-connector
Spark Connector for Apache Doris
☆87Updated this week
cubefs / compass
Compass is a task diagnosis platform for bigdata
☆370Updated 2 months ago
ververica / flink-sql-benchmark
☆106Updated last year
maropu / spark-tpcds-datagen
All the things about TPC-DS in Apache Spark
☆104Updated last year
kangkaisen / olap-performance
OLAP Database Performance Tuning Guide
☆367Updated last year
StarRocks / starrocks-connector-for-apache-flink
☆196Updated last month
huangfox / dpkb
大数据相关内容汇总，包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词：Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
☆231Updated 2 months ago
apache / flink-connector-kafka
Apache flink
☆142Updated this week
apache / calcite-avatica
Apache Calcite Avatica
☆257Updated this week
leesf / hudi-resources
汇总Apache Hudi相关资料
☆544Updated last month