adipolak / ml-with-apache-spark
A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlow.
☆81Updated last year
Alternatives and similar repositories for ml-with-apache-spark:
Users that are interested in ml-with-apache-spark are comparing it to the libraries listed below
- Example repo to kickstart integration with mlflow pipelines.☆74Updated 2 years ago
- ☆23Updated 2 years ago
- ☆27Updated 2 years ago
- Capturing model drift and handling its response - Example webinar☆107Updated 5 years ago
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆23Updated last year
- ☆84Updated last year
- An example MLFlow project☆48Updated last month
- ☆30Updated 2 years ago
- A workshop with several modules to help learn Feast, an open-source feature store☆86Updated last month
- Reference code base for ML Engineering, Manning Publications☆126Updated 3 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆211Updated last year
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆61Updated 3 years ago
- Template repo for kickstarting recipes for regression use case☆54Updated 2 months ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Updated 2 years ago
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆110Updated 2 years ago
- Resources backing the Feast fraud tutorial on GCP☆14Updated 2 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆38Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 6 months ago
- ☆16Updated last year
- ☆29Updated 4 years ago
- Scaling Python Machine Learning☆45Updated last year
- Code snippets for Data Engineering Design Patterns book☆68Updated 2 weeks ago
- Delta Lake Documentation☆48Updated 8 months ago
- ☆41Updated 7 months ago
- PySpark Cheatsheet☆36Updated 2 years ago
- ☆10Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆102Updated 4 years ago
- Data Engineering with Spark and Delta Lake☆95Updated 2 years ago