Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.
☆53May 26, 2022Updated 3 years ago
Alternatives and similar repositories for kubeflow-spark
Users that are interested in kubeflow-spark are comparing it to the libraries listed below
Sorting:
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- Image building contents for running Spark standalone on Kubernetes☆16Apr 10, 2020Updated 5 years ago
- Mock streaming data generator☆17May 31, 2024Updated last year
- Dense or Sparse : Optimal SPMM-as-a-Service for Big-Data Processing☆18Aug 24, 2022Updated 3 years ago
- A modern, enterprise-ready business intelligence web application☆33Dec 9, 2022Updated 3 years ago
- Stackable Operator for Apache Airflow☆32Updated this week
- Edit code in IntelliJ, eval/run in Zeppelin notebook☆18Mar 17, 2019Updated 6 years ago
- ☆24Dec 20, 2022Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Dec 4, 2023Updated 2 years ago
- Fine-tuning LLMs on Flyte and Union Cloud☆30Dec 1, 2023Updated 2 years ago
- Serverless Python with Ray☆59Oct 14, 2022Updated 3 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated 2 years ago
- Spark—Python学习笔记☆11Sep 25, 2018Updated 7 years ago
- ☆11Oct 6, 2023Updated 2 years ago
- ☆14Feb 15, 2025Updated last year
- Dockerized monitoring stack for Apache Airflow☆36Sep 8, 2024Updated last year
- ☆30Jul 2, 2024Updated last year
- Import data from clickhouse to hadoop with pure SQL☆36Mar 19, 2019Updated 6 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,109Updated this week
- ☆35Feb 8, 2022Updated 4 years ago
- This Guidance demonstrates how to transform architecture diagrams into Infrastructure as Code (IaC) templates using AI, addressing the ch…☆39Mar 1, 2026Updated last week
- breast Cancer乳腺癌数据挖掘,python sklearn☆11Apr 13, 2019Updated 6 years ago
- Java library to fulfil the requirement of numpy in java☆22Oct 23, 2024Updated last year
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- 适合2到6岁的宝宝打字游戏☆10May 29, 2020Updated 5 years ago
- ☆15Apr 23, 2025Updated 10 months ago
- Repository for the dbt Semantic Layer course☆12Updated this week
- TV Control API specification - https://w3c.github.io/tvcontrol-api/☆10Jan 28, 2019Updated 7 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆43Dec 4, 2023Updated 2 years ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆42Jan 19, 2026Updated last month
- Apache Hive Metastore as a Standalone server in Docker☆80Aug 22, 2024Updated last year
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Dec 13, 2024Updated last year
- Crash and burn the Gibson to take out the Da Vinci virus☆12Dec 20, 2020Updated 5 years ago
- A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.☆11Sep 20, 2018Updated 7 years ago