A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlow.
☆86Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for ml-with-apache-spark
Users that are interested in ml-with-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆24May 12, 2023Updated 2 years ago
- A collection of simple python mini projects to enhance your python skills☆18Feb 18, 2022Updated 4 years ago
- ☆11Jun 17, 2024Updated last year
- lakeFS airflow operator☆27Oct 23, 2023Updated 2 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Sep 9, 2024Updated last year
- 🦀 Rust server running in a Docker container deployed to AWS ECS via Terraform 🚀☆12Dec 31, 2024Updated last year
- PostgreSQL + Grafana with test data running in Docker Compose. This is the repo used for the talk I gave at PostgresConf NYC 2019.☆10Sep 16, 2021Updated 4 years ago
- Resources backing the Feast fraud tutorial on GCP☆14May 31, 2022Updated 3 years ago
- High Performance with Java, published by Packt☆15Jul 18, 2024Updated last year
- ☆10Jul 17, 2023Updated 2 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Full Machine Learning Lifecycle using Airflow, MLflow, and AWS S3☆26Mar 28, 2023Updated 3 years ago
- simple ansible playbook to take clean ubuntu 18.04 to CUDA 10, PyTorch 1.0, fastai, miniconda heaven☆12Dec 16, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CYRULIK – official font repository of Warsaw Types project☆11Jul 23, 2025Updated 8 months ago
- Digital Transformation and Modernization with IBM API Connect, published by Packt☆12Jan 30, 2023Updated 3 years ago
- Customized Jupyter Spark Docker images with everything you need☆16May 3, 2025Updated 11 months ago
- Discover Bluemix, IBM Cloud Platform, through a set of hands-on labs.☆12Feb 13, 2024Updated 2 years ago
- Куски кода и приемы, которые часто переиспользую☆16Jan 3, 2024Updated 2 years ago
- An ANN-LSTM based Model for Learning Individual Customer Behavior in Response to Electricity Prices☆11Mar 27, 2020Updated 6 years ago
- Classifying products into product categories with the fastText machine learning library. (Python)☆12Sep 5, 2020Updated 5 years ago
- GitHub Repository for Azure AI-102 Essentials to Learn, Implement, and Certify☆33Feb 11, 2026Updated 2 months ago
- MPC Server for PySpark inpired by the LakeSail☆18Feb 26, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Example of packaging a Rust web application using Docker☆18Sep 2, 2020Updated 5 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- Using MLflow with a Docker Environment☆19Sep 17, 2020Updated 5 years ago
- Time Series Analysis with Python Cookbook, Second Edition - Published by Packt☆74Feb 12, 2026Updated 2 months ago
- ☆14May 15, 2024Updated last year
- A Scala library for Firestore in Datastore mode☆13Jun 11, 2024Updated last year
- a tor socks proxy docker image☆12Apr 8, 2026Updated last week
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆53Oct 31, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- In this notebook, we will create an AI and time serie driven forecasting engine based on a set of 5 AI models and 5 time series models an…☆14Jun 12, 2021Updated 4 years ago
- Access to the stringi API from within an Rcpp-based Project☆11Feb 3, 2025Updated last year
- Managing Data as a Product, published by Packt☆20Nov 30, 2024Updated last year
- ☆18Nov 27, 2020Updated 5 years ago
- HTML to Scalatags converter☆10Oct 8, 2018Updated 7 years ago
- Implementing Machine Learning tasks using Tensorflow framework☆16Feb 2, 2018Updated 8 years ago
- Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow☆66Jan 12, 2024Updated 2 years ago