gschmutz / stream-processing-workshop
Stream Processing Workshop
☆20Updated last month
Related projects: ⓘ
- ☆12Updated 11 months ago
- Ingest JSON records from Kafka to multiple tables in the database using the DataStax Apache Kafka Connector☆13Updated last year
- This repository contains recipes for Apache Pinot.☆23Updated 3 weeks ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- A curated list of awesome Databricks resources, including Spark☆14Updated 2 months ago
- NiFi Processor for Apache Pulsar☆10Updated 6 months ago
- AWS Big Data Certification☆24Updated last year
- A Data Mesh proof-of-concept built on Confluent Cloud☆2Updated last year
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆26Updated 4 years ago
- Various Demos mostly based on docker environments☆33Updated last year
- ☆2Updated last year
- A plugin for Flask Appbuilder, Keycloak, and Azure AD☆10Updated 2 years ago
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 2 years ago
- Kafka as your DataLake Demo☆11Updated last year
- KSQL Step-by-step tutorial using the basic functions of Apache Kafka's Streaming SQL Engine☆10Updated 5 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆15Updated this week
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆17Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 3 years ago
- This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset☆20Updated 4 years ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- Spark package for checking data quality☆25Updated last year
- A boilerplate project for Azure Big Data PaaS services☆14Updated last year
- Mastering Spark for Data Science, published by Packt☆46Updated last year
- Study notes for AWS Big Data Specialty certification☆10Updated 5 years ago
- Kafka Connect Examples☆42Updated last year
- Code for the fictitious food delivery company GottaEat used in the Pulsar In Action book☆17Updated 2 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 9 months ago
- ☆3Updated last year