The Internals of Spark on Kubernetes
☆73May 9, 2022Updated 4 years ago
Alternatives and similar repositories for spark-kubernetes-book
Users that are interested in spark-kubernetes-book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Internals of PySpark☆28Dec 29, 2024Updated last year
- The Internals of Spark SQL☆487Jan 25, 2026Updated 4 months ago
- The Internals of Delta Lake☆186May 10, 2026Updated last month
- ☆18May 7, 2026Updated last month
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 7 months ago
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated 3 months ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆70May 4, 2026Updated last month
- Testing Sandbox for Hadoop Ecosystem Components☆45Updated this week
- Spark on Kubernetes infrastructure Docker images repo☆37Oct 20, 2022Updated 3 years ago
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 6 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 3 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,127Updated this week
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10Mar 12, 2021Updated 5 years ago
- The Internals of Spark Structured Streaming☆420Mar 3, 2026Updated 3 months ago
- Spark extensions for business contexts☆36Feb 19, 2020Updated 6 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 4 years ago
- Docker image for sbt☆18Aug 6, 2022Updated 3 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆430Jan 14, 2022Updated 4 years ago
- The Internals of Apache Kafka☆132Aug 29, 2022Updated 3 years ago
- Host a graph database such as OrientDB on IBM Container Service using Kubernetes APIs☆12Apr 22, 2019Updated 7 years ago
- ☆25Mar 15, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆260Updated this week
- Spark on Kubernetes using Helm☆33Jun 9, 2020Updated 6 years ago
- Presto Trino with Apache Hive Postgres metastore☆43Sep 9, 2024Updated last year
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 3 years ago
- Use Kubernetes to autoscale your spark clusters.☆10May 2, 2019Updated 7 years ago
- The Internals of Apache Spark☆1,546Apr 12, 2026Updated 2 months ago
- Spark Connector to read and write with Pulsar☆120May 26, 2026Updated 3 weeks ago
- SBT project showing shading a library with SBT assembly☆15Oct 4, 2018Updated 7 years ago
- trino monitoring with JMX metrics through Prometheus and Grafana☆17Aug 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Unofficial embeddable Stackoverflow profile summary card☆11Nov 19, 2022Updated 3 years ago
- Example to show how to deploy kafka dependent scala microservice with docker☆15Nov 28, 2017Updated 8 years ago
- ☆17Feb 16, 2020Updated 6 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆42Oct 1, 2024Updated last year
- A tool to validate data, built around Apache Spark.☆102Updated this week
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Open source stack lakehouse☆25Mar 2, 2024Updated 2 years ago