adipolak / ml-with-apache-spark
A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlow.
☆81Updated last year
Alternatives and similar repositories for ml-with-apache-spark:
Users that are interested in ml-with-apache-spark are comparing it to the libraries listed below
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆23Updated last year
- ☆84Updated 2 years ago
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆97Updated 2 years ago
- Reference code base for ML Engineering, Manning Publications☆128Updated 3 years ago
- A workshop with several modules to help learn Feast, an open-source feature store☆88Updated 3 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆213Updated last year
- An example MLFlow project☆48Updated 3 months ago
- Resources backing the Feast fraud tutorial on GCP☆14Updated 2 years ago
- Machine Learning Engineering with MLflow, published by Packt☆115Updated 9 months ago
- Code samples for the Effective Data Science Infrastructure book☆115Updated last year
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆37Updated 9 months ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Updated 3 months ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆44Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆112Updated 2 years ago
- ☆27Updated 2 years ago
- Capturing model drift and handling its response - Example webinar☆108Updated 5 years ago
- Template repo for kickstarting recipes for regression use case☆54Updated 4 months ago
- Interactive Notebooks that support the book☆40Updated 4 years ago
- Scaling Python Machine Learning☆45Updated last year
- Because its never late to start taking notes and 'public' it...☆59Updated 5 months ago
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆63Updated 3 years ago
- ☆39Updated 3 years ago
- Code repository for the "PySpark in Action" book☆196Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- Spark and Delta Lake Workshop☆22Updated 2 years ago