Beam-College / season-2022
☆36Updated 2 years ago
Alternatives and similar repositories for season-2022:
Users that are interested in season-2022 are comparing it to the libraries listed below
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆66Updated 10 months ago
- Repository for Beam College sessions☆107Updated 3 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- markup to create labs for courses from the Google Cloud training catalog.☆49Updated 3 years ago
- ☆84Updated last year
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- ☆176Updated last month
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆124Updated this week
- Data Engineering with Spark and Delta Lake☆96Updated 2 years ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆158Updated 3 weeks ago
- Data Engineering with Google Cloud Platform, published by Packt☆113Updated last year
- Building Big Data Pipelines with Apache Beam, published by Packt☆86Updated last year
- Data engineering with dbt, published by Packt☆76Updated last year
- The go to demo for public and private dbt Learn☆76Updated 6 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- This project leverages GCS, Composer, Dataflow, BigQuery, and Looker on Google Cloud Platform (GCP) to build a robust data engineering so…☆22Updated last year
- An end to end demo of Google's Cloud data and analytic stack.☆240Updated last week
- ☆60Updated last month
- ☆128Updated 10 months ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆34Updated 10 months ago
- ☆53Updated last month
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆32Updated 9 months ago
- ☆134Updated 3 months ago
- Companion repository for the book 'Delta Lake Up and Running'☆45Updated 10 months ago
- Code snippets for Data Engineering Design Patterns book☆74Updated last month
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- ☆34Updated 7 months ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago