A Python package to submit and manage Apache Spark applications on Kubernetes.
☆46Feb 27, 2026Updated 3 months ago
Alternatives and similar repositories for spark-on-k8s
Users that are interested in spark-on-k8s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package to run DuckDB queries from Apache Airflow.☆21Jun 17, 2024Updated last year
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog☆24Updated this week
- ☆10May 5, 2022Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated 2 years ago
- A website made in hopes to recreate the no longer available Internet Wishlist. Open source project built completely by the community.☆14Dec 6, 2022Updated 3 years ago
- A demo instance of mage for pulling sample data from a public Google pub/sub topic and transforming with dbt.☆12Jan 5, 2024Updated 2 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆96May 11, 2026Updated last month
- Collection of Interesting Algorithms☆16Oct 13, 2020Updated 5 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆28Mar 17, 2026Updated 2 months ago
- Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logi…☆13Mar 8, 2020Updated 6 years ago
- ☆13May 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A library that brings useful functions from various modern database management systems to Apache Spark☆63Sep 4, 2023Updated 2 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆29May 19, 2025Updated last year
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- ☆13Feb 19, 2025Updated last year
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 3 years ago
- ☆15Nov 16, 2023Updated 2 years ago
- A JDBC streaming source for Spark☆10Feb 19, 2024Updated 2 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆29Apr 4, 2026Updated 2 months ago
- A minimal seed template for an Apache Pekko in Scala☆13Apr 28, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Flake8 plugin to lint for backwards incompatible database migrations☆12Updated this week
- ☆24Feb 7, 2024Updated 2 years ago
- A tiny library to make writing CBV-based APIs easier in Django.☆12Aug 23, 2024Updated last year
- functionality on top of an RDF store while accounting for and exploiting the fundamental differences between graph storage and relation…☆12Feb 21, 2024Updated 2 years ago
- PySpark test helper methods with beautiful error messages☆769May 20, 2026Updated 3 weeks ago
- The Paradise Papers dataset and guide from the International Consortium of Investigative Journalists (ICIJ)☆11Oct 25, 2024Updated last year
- ☆17Apr 2, 2024Updated 2 years ago
- X Tools for Claude MCP: A lightweight toolkit enabling Claude to search Twitter with natural language and display results based on user i…☆20Mar 25, 2025Updated last year
- This repository contains code for Spark Streaming☆26Mar 11, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13May 14, 2023Updated 3 years ago
- ISP^2 is a plug-and-play prompting method☆12May 25, 2026Updated 2 weeks ago
- ☆14Sep 10, 2025Updated 9 months ago
- Transfer.sh command line program, Now file sharing from the command line is easy.☆13Feb 28, 2023Updated 3 years ago
- Plug and play Heroicons, Tabler and Lucide icons for Django Cotton.☆18Feb 23, 2026Updated 3 months ago
- Dockerfile to build a Prosody XMPP server container image.☆12Feb 22, 2022Updated 4 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆129Sep 7, 2018Updated 7 years ago