A Gentle introduction to Machine Learning with Apache Spark
☆11Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for spark-intro-to-ml
Users that are interested in spark-intro-to-ml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google Cloud Functions Python Runtime Demo☆12Jul 27, 2018Updated 7 years ago
- Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool☆16Mar 5, 2025Updated last year
- Using WASM to write UDFs in Apache Spark☆12Jun 3, 2024Updated last year
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is home for the Migration Theme for WordPress - the starting point for your next migration project.☆25May 24, 2013Updated 12 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- BSR's new public API. Currently in development.☆21Jan 26, 2026Updated 2 months ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Daily-updated reading list for designing High Scalability , High Availability , High Stability back-end systems - Pull requests are gre…☆15Jul 14, 2022Updated 3 years ago
- Unity Catalog AI Model Context Protocol Server☆16Mar 28, 2025Updated last year
- WIP: Kubernetes Lets Encrypt Tutorial☆27Jul 18, 2016Updated 9 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- THIS PROJECT IS ABOUT TURKISH SENTIMENT ANALYSIS☆14Aug 23, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- JavaScript Script to remove all expired jobs at once☆12Feb 13, 2022Updated 4 years ago
- Freak's Axie Extension☆11Dec 17, 2021Updated 4 years ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- Code that maps Wikipedia contributions by IP address☆16Oct 2, 2024Updated last year
- Dremio Community Connector for HBase☆12Nov 7, 2024Updated last year
- ☆11Nov 8, 2017Updated 8 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- ☆15Aug 21, 2017Updated 8 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An implementation of apriori algorithm under spark platform☆11Dec 13, 2018Updated 7 years ago
- Jupyter Notebook with Spark support extracted from jupyter/docker-stack☆19Jul 4, 2018Updated 7 years ago
- Sample RESTful API for NodeSchool Workshop☆15Sep 13, 2016Updated 9 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 2 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- This lab teaches you how to create a realtime dashboard of stock prices using Hortonworks Data Platform and NiFi☆23Jan 18, 2016Updated 10 years ago
- Collection of notebooks☆17Oct 27, 2024Updated last year
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- ☆13May 22, 2025Updated 10 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A pyspark lib to validate data quality☆18Nov 11, 2022Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 5 months ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- Generate a changelog based on merged pull requests between tagged versions☆15Mar 15, 2018Updated 8 years ago
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 8 years ago