This is a library for SQL optimizing/rewriting including Materialized View rewrite
☆69Jun 21, 2022Updated 3 years ago
Alternatives and similar repositories for sql-booster
Users that are interested in sql-booster are comparing it to the libraries listed below
Sorting:
- A library based on delta for Spark and MLSQL☆60Dec 24, 2020Updated 5 years ago
- ☆13Jun 17, 2022Updated 3 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Apr 21, 2023Updated 2 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Apr 6, 2022Updated 3 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- ☆235Sep 15, 2022Updated 3 years ago
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- ☆22Jun 21, 2022Updated 3 years ago
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Apr 14, 2023Updated 2 years ago
- sql code autocomplete☆44Sep 2, 2020Updated 5 years ago
- 优化flink的多流操作(例如join),优化点不限于数据丢失问题,以及性能问题☆11Apr 8, 2019Updated 6 years ago
- ☆19Jun 16, 2021Updated 4 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- Big data smart alarm by sql☆12May 11, 2021Updated 4 years ago
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,051Oct 25, 2022Updated 3 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- Testing Sandbox for Hadoop Ecosystem Components☆44Mar 12, 2026Updated last week
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆284Feb 24, 2026Updated 3 weeks ago
- Learn Data Lake From Storage Layer.☆44Aug 4, 2024Updated last year
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 11 months ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆30Apr 16, 2018Updated 7 years ago
- Example Application to consume a twitter stream with Neo4j☆11Mar 16, 2017Updated 9 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆211Dec 5, 2022Updated 3 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,530Updated this week
- A query predictor pipeline and service to predict resource usages of Presto queries☆15May 2, 2023Updated 2 years ago
- ☆33May 9, 2025Updated 10 months ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,843May 29, 2024Updated last year
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- 算是简历吧....☆600Jan 6, 2023Updated 3 years ago
- ☆568Oct 30, 2023Updated 2 years ago
- a scala library for support jaskell design☆39Oct 23, 2023Updated 2 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆305Oct 30, 2025Updated 4 months ago