A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.
☆125Feb 22, 2021Updated 5 years ago
Alternatives and similar repositories for data_engineering_on_gcp_book
Users that are interested in data_engineering_on_gcp_book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build Your Own Roadmap☆11Jul 8, 2020Updated 5 years ago
- Accumulated knowledge and experience in the field of Data Engineering☆869Nov 22, 2022Updated 3 years ago
- Airflow ETL for Meetup API☆45Dec 27, 2018Updated 7 years ago
- Duke MIDS: Data Engineering and DataOps Course☆70Jan 10, 2025Updated last year
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Jun 20, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Toggling and ramping features via a lightweight Redis backend.☆18Sep 26, 2019Updated 6 years ago
- A terraform provider for Lightdash☆16Updated this week
- For over a year now, everything about my professional life has been around Google Cloud. This repo is a repercussion of my disastrous Goo…☆12Nov 17, 2019Updated 6 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆75May 3, 2024Updated last year
- using tensorflow with r☆25Nov 26, 2019Updated 6 years ago
- This is a record of all my coding practice including Data Manipulation, Data Structure and Algorithm, Data Visualization.☆13Apr 7, 2020Updated 6 years ago
- Example end to end data engineering project.☆1,412Dec 8, 2022Updated 3 years ago
- ☆18May 23, 2024Updated last year
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Apr 22, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- For my IBM Data Science Professional certificate capstone project in early 2020, I used pandas, the FourSquare API, Folium, and other Pyt…☆13Dec 31, 2020Updated 5 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Python Threading Jump-Start☆20Aug 10, 2022Updated 3 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆89Feb 11, 2021Updated 5 years ago
- A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt☆116Feb 28, 2022Updated 4 years ago
- The Data Engineering Cookbook☆15,063Jan 17, 2026Updated 3 months ago
- Iowa House Prices Kaggle (top 5%)☆15Jun 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A list of useful resources to learn Data Engineering from scratch☆3,987Jun 19, 2024Updated last year
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,903Aug 26, 2022Updated 3 years ago
- Video encoding & classification using tensorflow 2.0☆10Nov 12, 2019Updated 6 years ago
- LinkedIn Learning - Advanced SQL Series☆69Jul 28, 2024Updated last year
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Jan 6, 2021Updated 5 years ago
- Python Multiprocessing Jump-Start☆23Jul 29, 2022Updated 3 years ago
- Desarrollé un proyecto de ETL sobre archivos de diferentes orígenes (CSV, JSON). Luego, utilicé FastAPI para crear una API que permita re…☆10Dec 9, 2022Updated 3 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The Goal of this repo is to provide the solutions of Zindi Hackathons☆15Feb 4, 2022Updated 4 years ago
- Material for teaching vtk python☆17Jul 21, 2015Updated 10 years ago
- This program provides the skills you need to advance your career in data engineering and recommends training to support your preparation …☆21Aug 10, 2022Updated 3 years ago
- May the fourth wookieepdia data analysis (topic modeling / network analysis)☆24May 3, 2021Updated 4 years ago
- A repository containing link to some my Kaggle starter Notebooks☆11Jun 1, 2020Updated 5 years ago
- Full stack data engineering tools and infrastructure set-up☆58Feb 13, 2021Updated 5 years ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆351Jan 12, 2022Updated 4 years ago