iakovoskritikos / Data-Engineering
This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Programing language incorporating MySQL, MongoDB and Docker
β27Updated 2 years ago
Alternatives and similar repositories for Data-Engineering:
Users that are interested in Data-Engineering are comparing it to the libraries listed below
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMRβ82Updated 5 years ago
- πComplete End to End ETL Pipeline with Spark, Airflow, & AWSβ45Updated 5 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data froβ¦β21Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviewsβ115Updated 10 months ago
- β50Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.comβ149Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflowβ142Updated 4 years ago
- β27Updated last year
- YouTube tutorial projectβ102Updated last year
- This repo is mostly created for pyspark and hive related interview questions.β47Updated 3 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modelingβ100Updated 4 years ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.β103Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.β130Updated 7 months ago
- Udacity Data Engineering Nanodegree Capstone Projectβ36Updated 4 years ago
- β14Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.β175Updated 3 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.β14Updated 2 years ago
- Sample project to demonstrate data engineering best practicesβ184Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packtβ136Updated last year
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.β64Updated 7 months ago
- Resources for the free AWS Data Engineering course on youtubeβ99Updated 3 years ago
- Data Engineering with Google Cloud Platform, published by Packtβ114Updated last year
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science tβ¦β101Updated 2 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme