newfront / spark-intro-to-ml
A Gentle introduction to Machine Learning with Apache Spark
☆11Updated 2 years ago
Alternatives and similar repositories for spark-intro-to-ml:
Users that are interested in spark-intro-to-ml are comparing it to the libraries listed below
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Magic to help Spark pipelines upgrade☆34Updated 6 months ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Flowchart for debugging Spark applications☆105Updated 6 months ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆38Updated last year
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 3 years ago
- These are some code examples☆55Updated 5 years ago
- Kafka Examples repository.☆44Updated 6 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Snowflake Kafka Connector (Sink Connector)☆151Updated this week
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆66Updated 3 years ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- ☆73Updated 2 months ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- Spark with Scala example projects☆34Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 3 months ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- ☆22Updated last year
- ☆20Updated 5 years ago
- Basic getting started with Kafka examples☆47Updated 6 years ago
- ☆198Updated last year
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- Examples for High Performance Spark☆15Updated 5 months ago
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆21Updated 4 years ago
- Interactive Notebooks that support the book☆40Updated 4 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Updated 2 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year