PacktPublishing / Building-Big-Data-Pipelines-with-Apache-Beam
Building Big Data Pipelines with Apache Beam, published by Packt
☆86Updated 2 years ago
Alternatives and similar repositories for Building-Big-Data-Pipelines-with-Apache-Beam:
Users that are interested in Building-Big-Data-Pipelines-with-Apache-Beam are comparing it to the libraries listed below
- Interactive Notebooks that support the book☆40Updated 4 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆69Updated last year
- ☆137Updated 5 months ago
- ☆53Updated 9 months ago
- Data Engineering with Spark and Delta Lake☆98Updated 2 years ago
- ☆20Updated 5 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆39Updated last year
- The source code for the book Modern Data Engineering with Apache Spark☆36Updated 2 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆128Updated last month
- Spark Examples☆125Updated 3 years ago
- Repository for Beam College sessions☆107Updated 4 years ago
- ☆36Updated 2 years ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Updated last year
- Apache Spark Interview Question and Answers☆20Updated 4 years ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- ☆128Updated last year
- Build a real-time website analytics dashboard on GCP using Dataflow, Cloud Memorystore (Redis) and Spring Boot☆28Updated 2 months ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 3 months ago
- Cloud Dataproc: Samples and Utils☆11Updated 4 years ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆67Updated last week
- Apache Spark Course Material☆89Updated 2 years ago
- markup to create labs for courses from the Google Cloud training catalog.☆49Updated 3 years ago
- Apache Beam Python examples and templates.☆14Updated 2 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- Code snippets used in demos recorded for the blog.☆37Updated last week