Nunie123 / data_engineering_on_gcp_bookView external linksLinks
A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.
☆126Feb 22, 2021Updated 4 years ago
Alternatives and similar repositories for data_engineering_on_gcp_book
Users that are interested in data_engineering_on_gcp_book are comparing it to the libraries listed below
Sorting:
- Watch "A New Hope" in your terminal☆12May 4, 2019Updated 6 years ago
- Examples of causality maps for time series driven by GitHub actions☆15Nov 3, 2023Updated 2 years ago
- Accumulated knowledge and experience in the field of Data Engineering☆871Nov 22, 2022Updated 3 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Jan 30, 2023Updated 3 years ago
- Example of how to import python native modules into a Databricks Notebook☆15Jul 18, 2019Updated 6 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Open data projects, including real-time and reusable data for local tech meetups, events, and map layers.☆17Aug 10, 2024Updated last year
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- Toggling and ramping features via a lightweight Redis backend.☆18Sep 26, 2019Updated 6 years ago
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- Example end to end data engineering project.☆1,384Dec 8, 2022Updated 3 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆90Feb 11, 2021Updated 5 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- Python Multiprocessing Jump-Start☆23Jul 29, 2022Updated 3 years ago
- Airflow ETL for Meetup API☆45Dec 27, 2018Updated 7 years ago
- Materials for the Advanced Data Analysis Techniques with Apache Spark mini-course☆27Dec 16, 2017Updated 8 years ago
- using tensorflow with r☆25Nov 26, 2019Updated 6 years ago
- A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt☆115Feb 28, 2022Updated 3 years ago
- This repo has many shell script scenarios☆36Nov 1, 2022Updated 3 years ago
- Creates a standardised output of the contents of A GTM account☆10Jan 20, 2020Updated 6 years ago
- ☆11Aug 12, 2022Updated 3 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆132Jul 8, 2022Updated 3 years ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆34May 23, 2023Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,814Aug 26, 2022Updated 3 years ago
- Data engineering interviews Q&A for data community by data community☆66Jun 7, 2020Updated 5 years ago
- Bitcoin Improvement Proposals☆17Sep 4, 2025Updated 5 months ago
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated 2 weeks ago
- Framework for studying cryptographic hash functions using SAT.☆10Dec 21, 2021Updated 4 years ago
- Data Engineering with AWS, Published by Packt☆337Apr 26, 2023Updated 2 years ago
- Apache Spark Guide☆35Feb 1, 2022Updated 4 years ago
- Data Engineering on GCP☆41Oct 20, 2022Updated 3 years ago
- ☆95Mar 16, 2023Updated 2 years ago
- ChRIS data and compute CONtroller☆11Dec 4, 2025Updated 2 months ago
- Demo Application with DataSUS death records and Streamlit☆11Dec 14, 2019Updated 6 years ago
- Construction of CNN model for detection of pneumonia in x-rays from scratch☆10Jun 23, 2021Updated 4 years ago
- CSC 424 Advanced Database Management Systems☆16Jan 1, 2020Updated 6 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago