A Python package to submit and manage Apache Spark applications on Kubernetes.
☆46Feb 27, 2026Updated 2 months ago
Alternatives and similar repositories for spark-on-k8s
Users that are interested in spark-on-k8s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package to run DuckDB queries from Apache Airflow.☆21Jun 17, 2024Updated last year
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- ☆10May 5, 2022Updated 4 years ago
- Example Flink and Kafka integration project☆15Nov 28, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Terraform module for Cloudera Manager☆11May 6, 2020Updated 6 years ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- A website made in hopes to recreate the no longer available Internet Wishlist. Open source project built completely by the community.☆14Dec 6, 2022Updated 3 years ago
- ☆23Feb 5, 2024Updated 2 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆96May 11, 2026Updated last week
- Dockerfile for OpenLogReplicator☆21Mar 3, 2026Updated 2 months ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logi…☆13Mar 8, 2020Updated 6 years ago
- Authentication classes to be used with requests☆39Jun 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 4 months ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆29May 19, 2025Updated last year
- ☆13Feb 19, 2025Updated last year
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 3 years ago
- ☆16Jun 5, 2023Updated 2 years ago
- A JDBC streaming source for Spark☆10Feb 19, 2024Updated 2 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆29Apr 4, 2026Updated last month
- A minimal seed template for an Apache Pekko in Scala☆13Apr 28, 2026Updated 3 weeks ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Produces a suitable .gitlab-ci.yml file from a Golang TXT Template to work as input for a parent/child triggered GitLab CICD pipeline.☆11May 6, 2026Updated 2 weeks ago
- Flake8 plugin to lint for backwards incompatible database migrations☆12May 15, 2026Updated last week
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- PyTorch library for breast cancer metastasis detection in whole-slide images of sentinel lymph node tissue from the Camelyon dataset☆15Nov 25, 2019Updated 6 years ago
- ☆23Feb 7, 2024Updated 2 years ago
- A tiny library to make writing CBV-based APIs easier in Django.☆12Aug 23, 2024Updated last year
- functionality on top of an RDF store while accounting for and exploiting the fundamental differences between graph storage and relation…☆12Feb 21, 2024Updated 2 years ago
- The starter for Martin Tile Server☆15Jan 12, 2026Updated 4 months ago
- ☆18Jun 9, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- Custom kube-scheduler for binpacking targeting Spark on EKS and other jobs workloads☆29Feb 24, 2026Updated 2 months ago
- ☆17Feb 19, 2024Updated 2 years ago
- Scalable Batch and Stream Data Processing☆30Aug 21, 2024Updated last year
- PySpark test helper methods with beautiful error messages☆765Updated this week
- Update cookiecutter projects☆12Mar 30, 2021Updated 5 years ago
- ☆17Apr 2, 2024Updated 2 years ago