jaceklaskowski / spark-kubernetes-bookView external linksLinks
The Internals of Spark on Kubernetes
☆72May 9, 2022Updated 3 years ago
Alternatives and similar repositories for spark-kubernetes-book
Users that are interested in spark-kubernetes-book are comparing it to the libraries listed below
Sorting:
- The Internals of PySpark☆27Dec 29, 2024Updated last year
- The Internals of Delta Lake☆187Nov 30, 2025Updated 2 months ago
- The Internals of Spark SQL☆484Jan 25, 2026Updated 3 weeks ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 2 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 2 months ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- ☆18Nov 4, 2024Updated last year
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆67Jan 27, 2026Updated 3 weeks ago
- Scrapy exporter for Big Data formats☆16Dec 2, 2025Updated 2 months ago
- ☆17Feb 16, 2020Updated 6 years ago
- Trino connectors for accessing APIs with an OpenAPI spec☆43Feb 9, 2026Updated last week
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 5 years ago
- spark-sparql-connector☆17Jan 27, 2016Updated 10 years ago
- Ranger Hive Metastore Plugin☆18Jul 21, 2023Updated 2 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆41Oct 1, 2024Updated last year
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- SBT project showing shading a library with SBT assembly☆15Oct 4, 2018Updated 7 years ago
- Apache Ranger Plugin for S3☆20Nov 30, 2022Updated 3 years ago
- 最简单的 spark sql on kubernetes 生产环境部署方案☆19Jun 12, 2023Updated 2 years ago
- Repository of the geospatial extension to DCAT-AP (GeoDCAT-AP)☆22Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94May 9, 2025Updated 9 months ago
- A tool to validate data, built around Apache Spark.☆100Feb 9, 2026Updated last week
- The Internals of Apache Spark☆1,538Jul 5, 2025Updated 7 months ago
- ☆25Mar 15, 2024Updated last year
- Spark SQL listener to record lineage information☆28Jan 24, 2021Updated 5 years ago
- a demo project to Analyze most popular twitter hashtags using Java 8 Spring-Boot Spark Streaming Kafka & Docker Demo.☆22Nov 27, 2019Updated 6 years ago
- Jupyter Notebook with GPU and Code Server!☆22Feb 25, 2024Updated last year
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- spark on kubernetes☆104Feb 20, 2023Updated 2 years ago
- Spark Connector to read and write with Pulsar☆117Feb 6, 2026Updated last week
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆159Nov 18, 2021Updated 4 years ago
- Data Engineering with Scala, published by Packt☆27Feb 5, 2024Updated 2 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆432Jan 14, 2022Updated 4 years ago
- The mm-ADT Virtual Machine☆35Nov 22, 2020Updated 5 years ago
- Spark Tutorial at the University of Maryland☆38Oct 24, 2014Updated 11 years ago
- Example project - Run an Akka Typed application on Scala 3☆24Apr 16, 2020Updated 5 years ago
- The Internals of Spark Structured Streaming☆422Jan 25, 2026Updated 3 weeks ago
- Helm charts for Trino and Trino Gateway☆193Jan 31, 2026Updated 2 weeks ago