Apache Spark Website
☆134Mar 12, 2026Updated last week
Alternatives and similar repositories for spark-website
Users that are interested in spark-website are comparing it to the libraries listed below
Sorting:
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- Minimal starter for using React + PostCSS with Webpack.☆17Feb 5, 2019Updated 7 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 2 years ago
- ☆103Mar 23, 2020Updated 5 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 2 months ago
- Running TPC-H on Apache Hive☆41Jul 15, 2019Updated 6 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆946Mar 2, 2026Updated 2 weeks ago
- A java agent for tracing which can be configured via simple text file and instruments the code without rebuilding the project.☆50Feb 28, 2026Updated 3 weeks ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆16Feb 18, 2026Updated last month
- Apache Spark Kubernetes Operator☆267Updated this week
- Unity Catalog UI☆43Sep 6, 2024Updated last year
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- As this has moved to Databricks, please go to: https://github.com/databricks/spark-xml☆15Dec 16, 2015Updated 10 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- Oryx 2 (incubating): Lambda architecture on Spark for real-time large scale machine learning☆14Jul 14, 2021Updated 4 years ago
- Mirror of Apache Hive☆33Mar 16, 2020Updated 6 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage o…☆31Jun 18, 2025Updated 9 months ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Mirror of Apache Toree (Incubating)☆749Mar 9, 2026Updated last week
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆967Updated this week
- Apache Spark Connect Client for Swift☆30Updated this week
- All the things about TPC-DS in Apache Spark☆109Jun 15, 2023Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,635Mar 13, 2026Updated last week
- ☆10Nov 9, 2017Updated 8 years ago
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,144Updated this week
- Spark, Spark Streaming and Spark SQL unit testing strategies☆215Oct 12, 2016Updated 9 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Nov 16, 2022Updated 3 years ago
- Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.☆42Dec 15, 2017Updated 8 years ago
- Spark integrations for working with Lance datasets☆46Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262May 12, 2024Updated last year
- The Internals of Apache Spark☆1,542Jul 5, 2025Updated 8 months ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Apr 6, 2022Updated 3 years ago
- A sample implementation of the Spark Datasource API☆24Apr 15, 2017Updated 8 years ago