A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlow.
☆86Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for ml-with-apache-spark
Users that are interested in ml-with-apache-spark are comparing it to the libraries listed below
Sorting:
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆24May 12, 2023Updated 2 years ago
- PostgreSQL + Grafana with test data running in Docker Compose. This is the repo used for the talk I gave at PostgresConf NYC 2019.☆10Sep 16, 2021Updated 4 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- Choose features that promote diversity and strong relationships to the target variable.☆16Oct 14, 2025Updated 4 months ago
- Official code for ICML 2024 paper "An Unsupervised Approach for Periodic Source Detection in Time Series"☆13Feb 21, 2025Updated last year
- Customized Jupyter Spark Docker images with everything you need☆16May 3, 2025Updated 10 months ago
- Куски кода и приемы, которые часто переиспользую☆16Jan 3, 2024Updated 2 years ago
- lakeFS airflow operator☆27Oct 23, 2023Updated 2 years ago
- Deploy any Machine Learning model serverless in AWS.☆24Oct 17, 2018Updated 7 years ago
- GitHub Repo for the UChicago, Spring 2021 course *Are We Doomed? Confronting the End of the World*☆12Mar 30, 2021Updated 4 years ago
- Hands-On Deep Learning with Apache Spark, Published by Packt☆31Apr 17, 2023Updated 2 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Updated this week
- Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow☆66Jan 12, 2024Updated 2 years ago
- Java library to fulfil the requirement of numpy in java☆22Oct 23, 2024Updated last year
- A Scala library for Firestore in Datastore mode☆13Jun 11, 2024Updated last year
- Global analysis platform for fluorescence data☆12Jan 6, 2026Updated 2 months ago
- PyData Essentials☆34Jul 7, 2017Updated 8 years ago
- Source for the "Making Python 100x faster with less than 100 lines of Rust" blog post☆42Aug 18, 2024Updated last year
- A fun little data analysis project to whether American prefers Mexican food over Italian food or Chinese Food.☆12Sep 11, 2017Updated 8 years ago
- a simple lakeFS webhook for pre-commit and pre-merge validation of data objects☆12Nov 9, 2023Updated 2 years ago
- Stock-keeping-oriented Prediction Error Costs (SPEC)☆12Jul 3, 2020Updated 5 years ago
- PD calibration techniques for LDP portfolios☆10May 29, 2016Updated 9 years ago
- ☆10Jan 23, 2023Updated 3 years ago
- Kafka library with a schema registry integration☆10Dec 16, 2025Updated 2 months ago
- ☆15Sep 7, 2025Updated 6 months ago
- Ejemplo de cómo trabajar con gráficos en Kotlin☆12Sep 29, 2022Updated 3 years ago
- A Gentle Introduction to RAG☆15Oct 8, 2024Updated last year
- Visualize linear programming at https://lpviz.net☆33Jan 20, 2026Updated last month
- ☆14Dec 12, 2024Updated last year
- Part-of-speech tagger for the English language☆10Jul 31, 2018Updated 7 years ago
- Computing the gap statistics from Tibshirani et. al. for various clustering algorithms☆13Nov 10, 2025Updated 3 months ago
- Template for machine learning projects.☆12Jul 22, 2023Updated 2 years ago
- full code written for the Twilio blog https://www.twilio.com/blog/media-file-storage-python-flask-amazon-s3-buckets☆11May 4, 2024Updated last year
- ☆10Apr 18, 2024Updated last year
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- After creating and training the smoking detection model using YOLOv5, the next step is to deploy the model. In this project, Flask API an…☆10Mar 1, 2023Updated 3 years ago
- FastPy-RS is a high-performance Python library that provides optimized implementations of common functions using Rust.☆18Aug 19, 2025Updated 6 months ago
- MPC Server for PySpark inpired by the LakeSail☆17Feb 26, 2026Updated last week
- НИС "Методологии разработки ПО", ФКН ВШЭ, Старичков Н.Ю., Крахмалёв Д.С.☆13Mar 12, 2022Updated 3 years ago