PacktPublishing / Building-Big-Data-Pipelines-with-Apache-Beam
Building Big Data Pipelines with Apache Beam, published by Packt
☆83Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Building-Big-Data-Pipelines-with-Apache-Beam
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆89Updated last year
- ☆36Updated 2 years ago
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆20Updated 4 years ago
- ☆20Updated 5 years ago
- Code snippets for Data Engineering Design Patterns book☆40Updated this week
- ☆127Updated 6 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆63Updated 6 months ago
- ☆127Updated this week
- Repository for Beam College sessions☆104Updated 3 years ago
- The source code for the book Modern Data Engineering with Apache Spark☆33Updated 2 years ago
- markup to create labs for courses from the Google Cloud training catalog.☆50Updated 2 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated last year
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆65Updated 2 years ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆119Updated this week
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Spark data pipeline that processes movie ratings data.☆27Updated last week
- Code for my "Efficient Data Processing in SQL" book.☆50Updated 3 months ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated last year
- The official repository for the Rock the JVM Spark Optimization with Scala course☆55Updated 11 months ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆37Updated 11 months ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆109Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆19Updated 4 years ago
- GCP-Data-Engineer-Study-Guide☆118Updated 5 years ago
- Data Engineering with Databricks Cookbook, published by Packt☆45Updated 5 months ago