PacktPublishing / Building-Big-Data-Pipelines-with-Apache-Beam
Building Big Data Pipelines with Apache Beam, published by Packt
☆86Updated last year
Alternatives and similar repositories for Building-Big-Data-Pipelines-with-Apache-Beam:
Users that are interested in Building-Big-Data-Pipelines-with-Apache-Beam are comparing it to the libraries listed below
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆66Updated 10 months ago
- Repository for Beam College sessions☆107Updated 3 years ago
- ☆36Updated 2 years ago
- ☆128Updated 10 months ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- ☆134Updated 4 months ago
- Code snippets for Data Engineering Design Patterns book☆74Updated last month
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆21Updated 4 years ago
- ☆52Updated 7 months ago
- The source code for the book Modern Data Engineering with Apache Spark☆35Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆96Updated 2 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Interactive Notebooks that support the book☆40Updated 4 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆42Updated 2 years ago
- Apache Beam examples for running on Google Cloud Dataflow.☆30Updated 6 years ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆65Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆38Updated last year
- GCP-Data-Engineer-Study-Guide☆119Updated 5 years ago
- Apache Spark 3 - Structured Streaming Course Material☆45Updated 4 years ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆124Updated last week
- ☆20Updated 5 years ago
- Spark Examples☆125Updated 3 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- ☆71Updated 2 months ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Updated last year
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated last month
- Weekly Data Engineering Newsletter☆94Updated 8 months ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago