iakovoskritikos / Data-Engineering
This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Programing language incorporating MySQL, MongoDB and Docker
β26Updated last year
Alternatives and similar repositories for Data-Engineering:
Users that are interested in Data-Engineering are comparing it to the libraries listed below
- πComplete End to End ETL Pipeline with Spark, Airflow, & AWSβ43Updated 5 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.comβ135Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.β123Updated 6 months ago
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and trβ¦β10Updated last year
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMRβ80Updated 5 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data froβ¦β21Updated last year
- β44Updated last year
- β135Updated 2 years ago
- Azure Data Factoryβ57Updated this week
- YouTube tutorial projectβ99Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviewsβ109Updated 9 months ago
- This repo contains all the code used in the Python for Data Engineering Courseβ256Updated 10 months ago
- PySpark functions and utilities with examples. Assists ETL process of data modelingβ101Updated 4 years ago
- Udacity Data Engineering Nanodegree Capstone Projectβ35Updated 4 years ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science tβ¦β79Updated last month
- β19Updated last year
- Recohut - Learn data engineering, data scienceβ96Updated last year
- β14Updated 2 years ago
- β41Updated 7 months ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflowβ139Updated 4 years ago
- β149Updated 2 years ago
- Simple ETL pipeline using Pythonβ25Updated last year
- Git Repositoryβ137Updated 3 weeks ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.β102Updated 2 years ago
- Contains spark dataframe solutions of leetcode questionsβ24Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degreeβ74Updated last year
- β27Updated 2 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmarβ180Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programmeβ25Updated last year
- apache-spark-with-databricks-for-data-engineeringβ73Updated 8 months ago