newfront / spark-intro-to-ml
A Gentle introduction to Machine Learning with Apache Spark
☆11Updated last year
Related projects: ⓘ
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 9 months ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆20Updated 3 years ago
- Kafka Examples repository.☆43Updated 5 years ago
- These are some code examples☆55Updated 4 years ago
- A Data Mesh proof-of-concept built on Confluent Cloud☆2Updated last year
- Basic getting started with Kafka examples☆47Updated 5 years ago
- Spark DataFrame transformation and UDF test examples☆23Updated last year
- AWS Big Data Certification☆24Updated last year
- Magic to help Spark pipelines upgrade☆33Updated last month
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 5 years ago
- Spark package for checking data quality☆25Updated last year
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆37Updated 9 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆72Updated last year
- ☆19Updated this week
- Building Big Data Pipelines with Apache Beam, published by Packt☆81Updated last year
- Real-world Spark pipelines examples☆83Updated 6 years ago
- An example Apache Beam project.☆111Updated 7 years ago
- Interactive Notebooks that support the book☆38Updated 3 years ago
- Apache Beam starter repo for Python☆17Updated last week
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 2 years ago
- Examples for High Performance Spark☆15Updated 3 weeks ago
- ☆1Updated last year
- Make your libraries magically appear in Databricks.☆46Updated last year
- Materials (slides and code) for Kafka and Kafka Streams Workshops☆60Updated 3 months ago
- Code snippets used in demos recorded for the blog.☆28Updated 5 months ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆43Updated 5 months ago