A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlow.
☆86Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for ml-with-apache-spark
Users that are interested in ml-with-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆24May 12, 2023Updated 3 years ago
- An example of using Torch rust bindings to serve trained machine learning models via Actix Web☆17Aug 15, 2021Updated 4 years ago
- ☆11Jun 17, 2024Updated last year
- lakeFS airflow operator☆28Oct 23, 2023Updated 2 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Hands-On GPU Computing with Python, published by Packt☆32Jan 30, 2023Updated 3 years ago
- ☆14Sep 9, 2024Updated last year
- ☆28Feb 4, 2026Updated 3 months ago
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- An IoT Edge Module that generates sample data using [Bogus](https://github.com/bchavez/Bogus)☆10Dec 8, 2022Updated 3 years ago
- ☆12Jun 25, 2024Updated last year
- Hybrid architecture media server, media service and Streamlit client app using FastAPI and Python☆14Jul 12, 2022Updated 3 years ago
- Streamlit Cookbook, published by Packt☆14Jun 6, 2025Updated 11 months ago
- Full Machine Learning Lifecycle using Airflow, MLflow, and AWS S3☆26Mar 28, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- a simple lakeFS webhook for pre-commit and pre-merge validation of data objects☆13Nov 9, 2023Updated 2 years ago
- Supports the easy and robust publication of technical books with Asciidoctor☆15Apr 12, 2026Updated last month
- Digital Transformation and Modernization with IBM API Connect, published by Packt☆12Jan 30, 2023Updated 3 years ago
- A Python client for the Enigma API.☆14Dec 7, 2022Updated 3 years ago
- the structure my blog uses☆15Dec 4, 2022Updated 3 years ago
- Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + Airflow + Jupyter☆11Oct 28, 2022Updated 3 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 7 years ago
- ☆16Oct 21, 2024Updated last year
- AI enhanced automation tool for financial modelling and market analysis.☆12Sep 10, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- Developer for Life Blog☆16Sep 2, 2021Updated 4 years ago
- Teaching notes from my Advanced SQL workshops as local lead instructor at General Assembly New York. The first edition was created for th…☆18Feb 14, 2020Updated 6 years ago
- Using MLflow with a Docker Environment☆19Sep 17, 2020Updated 5 years ago
- ☆47Mar 26, 2026Updated 2 months ago
- Time Series Analysis with Python Cookbook, Second Edition - Published by Packt☆75Feb 12, 2026Updated 3 months ago
- GitHub Repository for Azure AI-102 Essentials to Learn, Implement, and Certify☆35Feb 11, 2026Updated 3 months ago
- Choose features that promote diversity and strong relationships to the target variable.☆18Apr 24, 2026Updated last month
- A Scala library for Firestore in Datastore mode☆13Jun 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-factor Risk Models of Asset or Portfolio Returns☆10May 4, 2021Updated 5 years ago
- Google Cloud Storage Python Client☆14Dec 26, 2022Updated 3 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆53Oct 31, 2023Updated 2 years ago
- In this notebook, we will create an AI and time serie driven forecasting engine based on a set of 5 AI models and 5 time series models an…☆14Jun 12, 2021Updated 4 years ago
- ☆18Nov 27, 2020Updated 5 years ago
- Wrapper for SurveyGizmo's restful API service☆16Sep 24, 2020Updated 5 years ago
- Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow☆66Jan 12, 2024Updated 2 years ago