iampawanpoojary / gcp_professional_data_engineer_notes
☆11Updated 4 years ago
Related projects: ⓘ
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆20Updated 5 years ago
- ☆27Updated 10 months ago
- ☆29Updated last year
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆27Updated last month
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆83Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆54Updated last month
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆22Updated last year
- ☆32Updated 9 months ago
- Processing TfL data for bike usage with Google Cloud Platform.☆39Updated 2 years ago
- Analysis of SQL Leetcode and classic interview questions. Common pitfalls, anti-patterns and handy tricks are discussed. Sample databases…☆44Updated 3 years ago
- ☆84Updated 2 years ago
- Simple ETL pipeline using Python☆20Updated last year
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated last year
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆22Updated 3 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆20Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆47Updated 3 months ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated 9 months ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆98Updated 3 years ago
- Duke MIDS: Data Engineering and DataOps Course☆55Updated last year
- Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRu…☆20Updated last year
- ☆35Updated last year
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆41Updated 2 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆12Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆39Updated 5 years ago
- Sample project to demonstrate data engineering best practices☆156Updated 6 months ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆91Updated last month
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 5 months ago
- ☆23Updated last year
- (Python, PySpark)☆11Updated 3 years ago