A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.
☆125Feb 22, 2021Updated 5 years ago
Alternatives and similar repositories for data_engineering_on_gcp_book
Users that are interested in data_engineering_on_gcp_book are comparing it to the libraries listed below
Sorting:
- Watch "A New Hope" in your terminal☆12May 4, 2019Updated 6 years ago
- Build Your Own Roadmap☆11Jul 8, 2020Updated 5 years ago
- Accumulated knowledge and experience in the field of Data Engineering☆871Nov 22, 2022Updated 3 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- Toggling and ramping features via a lightweight Redis backend.☆18Sep 26, 2019Updated 6 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- Open data projects, including real-time and reusable data for local tech meetups, events, and map layers.☆17Aug 10, 2024Updated last year
- Python Threading Jump-Start☆20Aug 10, 2022Updated 3 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- Airflow ETL for Meetup API☆45Dec 27, 2018Updated 7 years ago
- Materials for the Advanced Data Analysis Techniques with Apache Spark mini-course☆27Dec 16, 2017Updated 8 years ago
- This repository contains all the python projects done as a tutorial☆26Mar 27, 2025Updated 11 months ago
- using tensorflow with r☆25Nov 26, 2019Updated 6 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆75May 3, 2024Updated last year
- A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt☆115Feb 28, 2022Updated 4 years ago
- Introduccion al Aprendizaje Automatico - 1er cuatrimestre 2023☆10Jun 8, 2023Updated 2 years ago
- This repo has many shell script scenarios☆36Nov 1, 2022Updated 3 years ago
- Creates a standardised output of the contents of A GTM account☆10Jan 20, 2020Updated 6 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆132Jul 8, 2022Updated 3 years ago
- A list of useful resources to learn Data Engineering from scratch☆3,960Jun 19, 2024Updated last year
- The Data Engineering Cookbook☆14,977Jan 17, 2026Updated last month
- Data Engineering on Google Cloud Platform☆379Jul 29, 2024Updated last year
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆34May 23, 2023Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,831Aug 26, 2022Updated 3 years ago
- Simple python script that converts all Excel files (xls, xlsx, xlsm, csv) in a directory into xlsb files.☆10Mar 13, 2023Updated 2 years ago
- Russian words synonyms and antonyms☆11Dec 7, 2021Updated 4 years ago
- Framework for studying cryptographic hash functions using SAT.☆10Dec 21, 2021Updated 4 years ago
- Bitcoin Improvement Proposals☆17Mar 2, 2026Updated last week
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated last month
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- Data Engineering with AWS, Published by Packt☆339Mar 2, 2026Updated last week
- Apache Spark Guide☆35Feb 1, 2022Updated 4 years ago
- ☆96Mar 16, 2023Updated 2 years ago
- This is a list of YAML file examples for Docker, Kubernetes, Ansible. Also includes a Python script.☆10Jan 12, 2021Updated 5 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- Construction of CNN model for detection of pneumonia in x-rays from scratch☆10Jun 23, 2021Updated 4 years ago
- CSC 424 Advanced Database Management Systems☆16Jan 1, 2020Updated 6 years ago