salcaino / sfucmpt733
SFU CMPT 733 public repo
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for sfucmpt733
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆25Updated last year
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆123Updated 2 years ago
- Simple stream processing pipeline☆91Updated 4 months ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Apache Spark Course Material☆85Updated last year
- Apache Spark 3 - Structured Streaming Course Material☆119Updated last year
- The official repository for the Rock the JVM Spark Essentials with Scala course☆260Updated 3 weeks ago
- Code snippets used in demos recorded for the blog.☆29Updated 3 weeks ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Docker with Airflow and Spark standalone cluster☆244Updated last year
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆84Updated 3 weeks ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated last year
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆40Updated 11 months ago
- Apache Spark 3 - Structured Streaming Course Material☆43Updated 4 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆55Updated 11 months ago
- Spark all the ETL Pipelines☆32Updated last year
- Open source stack lakehouse☆25Updated 8 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆31Updated 5 months ago
- Complete high-quality practice tests of 50 questions each will help you master your Confluent Certified Developer for Apache Kafka (CCDAK…☆75Updated last year
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆66Updated 10 months ago
- Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink☆31Updated 2 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆43Updated last year
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆274Updated 2 months ago
- ☆22Updated 7 months ago
- ☆22Updated last year
- ☆9Updated 3 years ago