A Python package to submit and manage Apache Spark applications on Kubernetes.
☆46Feb 27, 2026Updated last month
Alternatives and similar repositories for spark-on-k8s
Users that are interested in spark-on-k8s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package to run DuckDB queries from Apache Airflow.☆21Jun 17, 2024Updated last year
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- MCP Server for Apache Airflow☆31Oct 14, 2025Updated 5 months ago
- Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog☆23Mar 20, 2026Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- ☆22Feb 5, 2024Updated 2 years ago
- A demo instance of mage for pulling sample data from a public Google pub/sub topic and transforming with dbt.☆12Jan 5, 2024Updated 2 years ago
- React Native module for lightweight universal authentication using Keycloak☆23Feb 8, 2023Updated 3 years ago
- Dockerfile for OpenLogReplicator☆21Mar 3, 2026Updated 3 weeks ago
- ☆26Sep 15, 2025Updated 6 months ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logi…☆13Mar 8, 2020Updated 6 years ago
- ansible role redis (cluster and standalone mode)☆13Nov 2, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13May 11, 2025Updated 10 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28May 19, 2025Updated 10 months ago
- ☆13Feb 19, 2025Updated last year
- ☆15Nov 16, 2023Updated 2 years ago
- ☆16Jun 13, 2023Updated 2 years ago
- A JDBC streaming source for Spark☆10Feb 19, 2024Updated 2 years ago
- A minimal seed template for an Apache Pekko in Scala☆12Mar 16, 2026Updated 2 weeks ago
- AI model Prompt Tester (AIPT for short) is a simple app that will check how suitable each model is for a given prompt.☆15Jul 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Produces a suitable .gitlab-ci.yml file from a Golang TXT Template to work as input for a parent/child triggered GitLab CICD pipeline.☆11Updated this week
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- Flake8 plugin to lint for backwards incompatible database migrations☆12Mar 23, 2026Updated last week
- PyTorch library for breast cancer metastasis detection in whole-slide images of sentinel lymph node tissue from the Camelyon dataset☆15Nov 25, 2019Updated 6 years ago
- Custom kube-scheduler for binpacking targeting Spark on EKS and other jobs workloads☆26Feb 24, 2026Updated last month
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Chotot Web Standards☆10Feb 2, 2026Updated last month
- ☆18Mar 24, 2020Updated 6 years ago
- The starter for Martin Tile Server☆15Jan 12, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A tiny library to make writing CBV-based APIs easier in Django.☆12Aug 23, 2024Updated last year
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- Deploy your own private OpenAI-compatible LLM☆27Jun 5, 2025Updated 9 months ago
- ☆17Feb 19, 2024Updated 2 years ago
- PySpark test helper methods with beautiful error messages☆756Updated this week
- The Paradise Papers dataset and guide from the International Consortium of Investigative Journalists (ICIJ)☆11Oct 25, 2024Updated last year
- Repository for the dbt Semantic Layer course☆13Mar 23, 2026Updated last week