shawlu95 / Data-Engineering-ToolboxLinks

Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.

☆18

Alternatives and similar repositories for Data-Engineering-Toolbox

Users that are interested in Data-Engineering-Toolbox are comparing it to the libraries listed below

Sorting:

hyunjoonbok / PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
☆104Updated 5 years ago
BenSchr / Udacity-Data-Engineering-Projects
My solutions for the Udacity Data Engineering Nanodegree
☆34Updated 6 years ago
PacktPublishing / Mastering-Big-Data-Analytics-with-PySpark
Mastering Big Data Analytics with PySpark, Published by Packt
☆165Updated last year
kaburelabs / Data-Engineering-track-with-Python
All Data Engineering notebooks from Datacamp course
☆116Updated 6 years ago
gabfr / data-engineering-nanodegree
notebooks produced throughout the Udacity's Nanodegree Data Engineering Course
☆74Updated 5 years ago
supratim94336 / DataEngineeringCapstoneProject
😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS
☆50Updated 6 years ago
chandra1sekar / data-engineering
☆31Updated 7 years ago
itversity / data-engineering-spark
☆88Updated 3 years ago
ajupton / big-data-engineering-project
Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR
☆88Updated 6 years ago
danieldiamond / data-engineering-capstone
Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development
☆21Updated 6 years ago
immu0001 / Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
☆75Updated 2 years ago
nareshk1290 / Udacity-Data-Engineering
Udacity Data Engineering Nano Degree (DEND)
☆189Updated 6 years ago
raveendratal / ravi_azureadbadf
Ravi Azure ADB ADF Repository
☆64Updated last year
AnandDedha / aws-airflow-dataengineering-pipeline
☆21Updated 2 years ago
arverma / TowardsDataEngineering
This repo contains commands that data engineers use in day to day work.
☆61Updated 3 years ago
manuel-lang / Data-Engineering-Nanodegree
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…
☆57Updated 3 years ago
Flor91 / Data-engineering-nanodegree
Projects done in the Data Engineering Nanodegree by Udacity.com
☆273Updated 6 years ago
sankamuk / PysparkCheatsheet
PySpark Cheatsheet
☆36Updated 3 years ago
tirthajyoti / Spark-with-Python
Fundamentals of Spark with Python (using PySpark), code examples
☆362Updated 3 years ago
Wittline / uber-expenses-tracking
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …
☆123Updated 3 years ago
shravan-kuchkula / udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…
☆89Updated 4 years ago
dgadiraju / itversity-books
☆118Updated 5 years ago
ismaildawoodjee / aws-data-pipeline
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…
☆23Updated 3 years ago
shravan-kuchkula / udacity-data-eng-proj2
A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…
☆24Updated 4 years ago
vivek-bombatkar / Spark-with-Python---My-learning-notes-
ETL pipeline using pyspark (Spark - Python)
☆116Updated 5 years ago
CICIFLY / Data_Engineering_Project_Portfolio
Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3
☆31Updated 5 years ago
damklis / etljob
Simple ETL pipeline using Python
☆29Updated 2 years ago
bobbydreamer / Udacity-Nano_Degree_Data_Engineering
My Udacity Data Engineer Nano Degree Projects aka Udacity DEND
☆16Updated 5 years ago
Joshua-omolewa / Retailstore_ETL_pipeline_project
Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…
☆11Updated 2 years ago
JoseRFJuniorLLMs / PySpark-ETL
PySpark-ETL
☆22Updated 6 years ago