sbakiu / kubeflow-sparkView external linksLinks
Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.
☆53May 26, 2022Updated 3 years ago
Alternatives and similar repositories for kubeflow-spark
Users that are interested in kubeflow-spark are comparing it to the libraries listed below
Sorting:
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Deploying a Machine Learning model streaming application with Apache Kafka☆11Aug 21, 2022Updated 3 years ago
- Resources backing the Feast fraud tutorial on GCP☆14May 31, 2022Updated 3 years ago
- ☆31Jul 8, 2022Updated 3 years ago
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆69Feb 6, 2026Updated last week
- Image building contents for running Spark standalone on Kubernetes☆16Apr 10, 2020Updated 5 years ago
- ☆19Dec 19, 2023Updated 2 years ago
- Mock streaming data generator☆17May 31, 2024Updated last year
- End to end mlflow with feast example☆17May 18, 2021Updated 4 years ago
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Dense or Sparse : Optimal SPMM-as-a-Service for Big-Data Processing☆18Aug 24, 2022Updated 3 years ago
- Full Stack Data Science projects centered around Apache Spark Streaming for educational purpose.☆19May 1, 2023Updated 2 years ago
- A modern, enterprise-ready business intelligence web application☆33Dec 9, 2022Updated 3 years ago
- Stackable Operator for Apache Airflow☆32Updated this week
- Edit code in IntelliJ, eval/run in Zeppelin notebook☆18Mar 17, 2019Updated 6 years ago
- ☆24Dec 20, 2022Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Dec 4, 2023Updated 2 years ago
- Fine-tuning LLMs on Flyte and Union Cloud☆30Dec 1, 2023Updated 2 years ago
- ☆13Feb 15, 2025Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated last year
- A set of tools that make working with the Scala ecosystem even better.☆12Updated this week
- ☆11Oct 6, 2023Updated 2 years ago
- ☆30Jul 2, 2024Updated last year
- Import data from clickhouse to hadoop with pure SQL☆36Mar 19, 2019Updated 6 years ago
- The official repository for the Rock the JVM Flink course☆31Dec 30, 2025Updated last month
- ☆35Feb 8, 2022Updated 4 years ago
- Denoising GANs -- TensorFlow2 training code for Gaussian denoiser using the GAN framework.☆10Jan 6, 2022Updated 4 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Feb 1, 2026Updated 2 weeks ago
- This Guidance demonstrates how to transform architecture diagrams into Infrastructure as Code (IaC) templates using AI, addressing the ch…☆37Updated this week
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆14Jan 8, 2024Updated 2 years ago
- AQIPython is a Python module that calculates the Air Quality Index (AQI) for various air pollutants based on different standards.☆10Mar 5, 2024Updated last year
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- Spark in Kubernetes☆39Jun 3, 2024Updated last year
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Jun 16, 2025Updated 8 months ago