sbakiu / kubeflow-spark
Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.
☆50Updated 2 years ago
Related projects: ⓘ
- A workshop with several modules to help learn Feast, an open-source feature store☆82Updated last week
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆40Updated 7 months ago
- End to End example integrating MLFlow and Seldon Core☆51Updated 3 years ago
- Example repo to kickstart integration with mlflow pipelines.☆73Updated last year
- Spark on Kubernetes infrastructure Helm charts repo☆199Updated last year
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Performance optimization for Spark running on Kubernetes☆84Updated 4 years ago
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- ☆54Updated 8 months ago
- Big Data Newsletter☆21Updated 5 months ago
- This is a collection of MLflow examples that you can directly run with mlflow command☆30Updated 4 years ago
- This repository builds a production-ready Docker image to productionalize an MLFlow cluster☆12Updated 3 years ago
- Spark on Kubernetes infrastructure Docker images repo☆37Updated last year
- A repository of helm charts☆31Updated last year
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆60Updated 2 years ago
- Magic to help Spark pipelines upgrade☆33Updated last month
- REST API for Apache Spark on K8S or YARN☆89Updated last week
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆73Updated 11 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- This repository contains code for Spark Streaming☆21Updated 3 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆91Updated this week
- This repository contains all tutorials for Apache Spark, Delta Lake, Koalas, MLflow, and other.☆15Updated 4 years ago
- A Table format agnostic data sharing framework☆36Updated 7 months ago
- Kubeflow example of machine learning/model serving☆35Updated 4 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆49Updated 2 weeks ago
- spark on kubernetes☆105Updated last year
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆90Updated last year
- ☆23Updated 2 years ago