Community Java bindings for https://github.com/facebookincubator/velox
☆41Mar 20, 2026Updated this week
Alternatives and similar repositories for velox4j
Users that are interested in velox4j are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Spark integrations for working with Lance datasets☆46Updated this week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,532Updated this week
- Remote Shuffle Service for Flink☆191Jan 6, 2023Updated 3 years ago
- Apache DataFusion Benchmarks☆22Mar 3, 2026Updated 3 weeks ago
- Mirror of Apache Hive☆33Mar 16, 2020Updated 6 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 2 months ago
- The gateway component to make Spark on K8s much easier for Spark users.☆216Dec 16, 2025Updated 3 months ago
- Tasks API for Stateful Functions on Flink☆13Feb 28, 2026Updated 3 weeks ago
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆31Feb 5, 2026Updated last month
- A modular acceleration toolkit for big data analytic engines☆66May 6, 2024Updated last year
- ☆49Feb 14, 2022Updated 4 years ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Database smell detector☆13Jan 24, 2018Updated 8 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- An Extensible Data Skipping Framework☆48Jul 15, 2025Updated 8 months ago
- 📤 In-memory implementation of SQS ideal for unit testing.☆14Jun 8, 2024Updated last year
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆971Updated this week
- Benchmarks for Apache Flink☆184Jan 4, 2026Updated 2 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,487Updated this week
- ☆248Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,730Updated this week
- ☆100Updated this week
- Gluten: Plugin to Boost Trino's Performance☆76Oct 25, 2023Updated 2 years ago
- Apache Iceberg C++☆195Updated this week
- End-to-end SQL fuzz testing for DataFusion using SQLancer☆12Feb 9, 2026Updated last month
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆431Jan 14, 2022Updated 4 years ago
- ☆11Mar 14, 2024Updated 2 years ago
- ☆19Oct 15, 2024Updated last year
- ☆34Mar 5, 2026Updated 2 weeks ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,079Updated this week
- All the things about TPC-DS in Apache Spark☆109Jun 15, 2023Updated 2 years ago
- 项目中保留了向开源社区提交过的patch☆16Oct 22, 2017Updated 8 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- Apache Fluss is a streaming storage built for real-time analytics.☆1,826Updated this week
- Oxia Java client SDK☆19Mar 16, 2026Updated last week
- Rust based high-performance Apache Uniffle shuffle-server☆62Updated this week