This is a library for SQL optimizing/rewriting including Materialized View rewrite
☆69Jun 21, 2022Updated 3 years ago
Alternatives and similar repositories for sql-booster
Users that are interested in sql-booster are comparing it to the libraries listed below
Sorting:
- A library based on delta for Spark and MLSQL☆60Dec 24, 2020Updated 5 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Apr 21, 2023Updated 2 years ago
- ☆13Jun 17, 2022Updated 3 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆183Apr 6, 2022Updated 3 years ago
- ☆22Jun 21, 2022Updated 3 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 2 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- sql code autocomplete☆44Sep 2, 2020Updated 5 years ago
- ☆235Sep 15, 2022Updated 3 years ago
- ☆19Jun 16, 2021Updated 4 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Apr 14, 2023Updated 2 years ago
- ☆17Mar 19, 2024Updated last year
- My Blog☆76May 3, 2018Updated 7 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆211Dec 5, 2022Updated 3 years ago
- Testing Sandbox for Hadoop Ecosystem Components☆44Updated this week
- a scala library for support jaskell design☆39Oct 23, 2023Updated 2 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,057Oct 25, 2022Updated 3 years ago
- Byzer VSCode Extension☆12Apr 19, 2023Updated 2 years ago
- Liga: Let Data Dance with ML Models☆13Sep 12, 2023Updated 2 years ago
- ☆13Mar 23, 2019Updated 6 years ago
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Apr 2, 2018Updated 7 years ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆977Nov 16, 2022Updated 3 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,516Updated this week
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- Big data smart alarm by sql☆12May 11, 2021Updated 4 years ago
- A RPC framework leveraging Spark RPC module☆209Mar 13, 2019Updated 6 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- ☆568Oct 30, 2023Updated 2 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆381Dec 16, 2023Updated 2 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷 融入数仓 ETLT 过程中,简单易用。☆34Feb 5, 2026Updated 3 weeks ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆303Oct 30, 2025Updated 4 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆284Feb 18, 2026Updated last week
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,845May 29, 2024Updated last year
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 10 months ago