Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
Alternatives and similar repositories for sql-ds-cache
Users that are interested in sql-ds-cache are comparing it to the libraries listed below
Sorting:
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.☆18Mar 27, 2024Updated last year
- Community Java bindings for https://github.com/facebookincubator/velox☆39Updated this week
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 2 years ago
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆60Mar 6, 2023Updated 2 years ago
- The Lightning Catalog is an open-source data catalog designed for preparing data at any scale in ad-hoc analytics, data virtualization, …☆37Feb 5, 2026Updated 3 weeks ago
- ☆39Mar 4, 2019Updated 6 years ago
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated 11 months ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- DataFuse operator manages fuse-query and fuse-store clusters atop Kubernetes using CRDs.