The Internals of Spark on Kubernetes
☆73May 9, 2022Updated 3 years ago
Alternatives and similar repositories for spark-kubernetes-book
Users that are interested in spark-kubernetes-book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Internals of PySpark☆28Dec 29, 2024Updated last year
- The Internals of Spark SQL☆488Jan 25, 2026Updated 2 months ago
- The Internals of Delta Lake☆188Nov 30, 2025Updated 4 months ago
- ☆18Nov 4, 2024Updated last year
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 5 months ago
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated last month
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆68Jan 27, 2026Updated 2 months ago
- Testing Sandbox for Hadoop Ecosystem Components☆44Updated this week
- Spark on Kubernetes infrastructure Docker images repo☆37Oct 20, 2022Updated 3 years ago
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 6 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- ☆10Mar 12, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The Internals of Spark Structured Streaming☆423Mar 3, 2026Updated last month
- This project is part of my talk about Project Panama. The goal is to show how you can call (almost) any C library using Java☆14Mar 27, 2025Updated last year
- Spark extensions for business contexts☆36Feb 19, 2020Updated 6 years ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 5 months ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆431Jan 14, 2022Updated 4 years ago
- The Internals of Apache Kafka☆133Aug 29, 2022Updated 3 years ago
- ☆25Mar 15, 2024Updated 2 years ago
- ☆252Updated this week
- Spark on Kubernetes using Helm☆33Jun 9, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆286Feb 24, 2026Updated last month
- Presto Trino with Apache Hive Postgres metastore☆43Sep 9, 2024Updated last year
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 3 years ago
- Trino connectors for accessing APIs with an OpenAPI spec☆43Feb 9, 2026Updated 2 months ago
- The Internals of Apache Spark☆1,548Updated this week
- Spark Connector to read and write with Pulsar☆119Feb 23, 2026Updated last month
- A K8s-based infrastructure for analytics☆24Jan 15, 2020Updated 6 years ago
- SBT project showing shading a library with SBT assembly☆15Oct 4, 2018Updated 7 years ago
- trino monitoring with JMX metrics through Prometheus and Grafana☆17Aug 14, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Unofficial embeddable Stackoverflow profile summary card☆11Nov 19, 2022Updated 3 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆41Oct 1, 2024Updated last year
- A tool to validate data, built around Apache Spark.☆101Feb 19, 2026Updated last month
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Open source stack lakehouse☆25Mar 2, 2024Updated 2 years ago
- Helm charts for Trino and Trino Gateway☆194Mar 30, 2026Updated 2 weeks ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago