A Python package to submit and manage Apache Spark applications on Kubernetes.
☆46Feb 27, 2026Updated 2 months ago
Alternatives and similar repositories for spark-on-k8s
Users that are interested in spark-on-k8s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog☆24Apr 15, 2026Updated 2 weeks ago
- ☆10May 5, 2022Updated 3 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Example Flink and Kafka integration project☆15Nov 28, 2015Updated 10 years ago
- Tradução do livro "Snake Wrangling for Kids" - Domando Serpentes para Crianças☆11Nov 27, 2013Updated 12 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆95May 9, 2025Updated 11 months ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆27Mar 17, 2026Updated last month
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Jul 31, 2018Updated 7 years ago
- Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logi…☆13Mar 8, 2020Updated 6 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 3 months ago
- ☆13May 11, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆29May 19, 2025Updated 11 months ago
- Improving predictions of Bayesian neural nets via local linearization, AISTATS 2021☆15Dec 30, 2022Updated 3 years ago
- ☆13Feb 19, 2025Updated last year
- ☆15Nov 16, 2023Updated 2 years ago
- ☆16Jun 5, 2023Updated 2 years ago
- A JDBC streaming source for Spark☆10Feb 19, 2024Updated 2 years ago
- The ultimate Vim configuration: .vimrc (heavily customized, uncompromising and opinionated)☆11Mar 2, 2024Updated 2 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆29Apr 4, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- In this small project we will predict the email that in which folder it will go in spam or primary.☆11Jul 5, 2016Updated 9 years ago
- A minimal seed template for an Apache Pekko in Scala☆13Apr 7, 2026Updated 3 weeks ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- Flake8 plugin to lint for backwards incompatible database migrations☆12Updated this week
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- PyTorch library for breast cancer metastasis detection in whole-slide images of sentinel lymph node tissue from the Camelyon dataset☆15Nov 25, 2019Updated 6 years ago
- Custom kube-scheduler for binpacking targeting Spark on EKS and other jobs workloads☆28Feb 24, 2026Updated 2 months ago
- Metrics for airflow☆14Oct 25, 2023Updated 2 years ago
- ☆23Feb 7, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A tiny library to make writing CBV-based APIs easier in Django.☆12Aug 23, 2024Updated last year
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- ☆17Feb 19, 2024Updated 2 years ago
- PySpark test helper methods with beautiful error messages☆761Apr 14, 2026Updated 2 weeks ago
- X Tools for Claude MCP: A lightweight toolkit enabling Claude to search Twitter with natural language and display results based on user i…☆19Mar 25, 2025Updated last year
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆14Nov 21, 2018Updated 7 years ago
- ☆32Jan 30, 2026Updated 3 months ago