datastacktv / kubeflow-introductionLinks
Code examples for the Introduction to Kubeflow course
☆14Updated 4 years ago
Alternatives and similar repositories for kubeflow-introduction
Users that are interested in kubeflow-introduction are comparing it to the libraries listed below
Sorting:
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Read Delta tables without any Spark☆47Updated last year
- Pandas helper functions☆31Updated 2 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- PySpark phonetic and string matching algorithms☆39Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Machine Learning Projects with Flytekit☆37Updated 2 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Projects developed by Domino's R&D team☆78Updated 3 years ago
- Scaling Python Machine Learning☆49Updated last year
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆19Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 4 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆82Updated last year
- ☆16Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆66Updated 3 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- ☆12Updated 4 years ago
- real-time data + ML pipeline☆54Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated 2 weeks ago
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 3 years ago
- MLflow App Library☆79Updated 6 years ago
- ☆30Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆35Updated last year