datastacktv / kubeflow-introduction
Code examples for the Introduction to Kubeflow course
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for kubeflow-introduction
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆19Updated 4 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- PySpark phonetic and string matching algorithms☆35Updated 8 months ago
- Read Delta tables without any Spark☆47Updated 8 months ago
- ☆54Updated 10 months ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆43Updated 7 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆42Updated 9 months ago
- Record matching and entity resolution at scale in Spark☆31Updated last year
- A pyspark lib to validate data quality☆18Updated last year
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Updated 4 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 2 years ago
- Scaling Python Machine Learning☆44Updated last year
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆12Updated 10 months ago
- Machine Learning Projects with Flytekit☆35Updated last year
- Full stack data engineering tools and infrastructure set-up☆41Updated 3 years ago
- Data validation library for PySpark 3.0.0☆34Updated last year
- ☆17Updated 2 years ago
- Pandas helper functions☆29Updated last year
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 2 years ago
- This repo is an approach to TDD in machine learning model operation. it covers project structure, testing essentials using pytest with Gi…☆14Updated 3 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆18Updated last year
- This is a repository for the Duke University Cloud Computing course project on Serveless Data Engineering Pipeline. For this project, I r…☆19Updated 3 years ago
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 3 years ago
- ☆12Updated 4 years ago
- ∞ Priceloop Engineering Conventions for Scala, Python, Git Workflow etc☆102Updated 2 years ago