Data engineering interviews Q&A for data community by data community
☆66Jun 7, 2020Updated 5 years ago
Alternatives and similar repositories for data-engineering-interviews
Users that are interested in data-engineering-interviews are comparing it to the libraries listed below
Sorting:
- ☆10Jun 29, 2023Updated 2 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Mar 20, 2017Updated 8 years ago
- Apache Spark Interview Question and Answers☆21Oct 13, 2020Updated 5 years ago
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 2 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆32Feb 2, 2021Updated 5 years ago
- An ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables☆15May 5, 2020Updated 5 years ago
- More than 2000+ Data engineer interview questions.☆1,524Jan 13, 2026Updated last month
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- The open source version of the Amazon Redshift Getting Started Guide.☆15Jun 15, 2023Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Aug 5, 2020Updated 5 years ago
- Template for Data Engineering and Data Pipeline projects☆116Jan 1, 2023Updated 3 years ago
- Different study cases of handling and analysis data, using python tools.☆16Dec 10, 2021Updated 4 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- pyspark dataframe made easy☆16Dec 15, 2021Updated 4 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- sql-for-data-engineering-course☆18May 12, 2023Updated 2 years ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆22Feb 18, 2021Updated 5 years ago
- Read SAS files in JavaScript. Because you always wanted to do that, right?☆29Dec 4, 2019Updated 6 years ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆31Feb 12, 2021Updated 5 years ago
- ☆29Jul 29, 2023Updated 2 years ago
- A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt☆115Feb 28, 2022Updated 4 years ago
- Various machine learning approaches are widely applied for short-term solar power forecasting, which is highly demanded for renewable ene…☆13Feb 18, 2020Updated 6 years ago
- Statistical modeling lies at the heart of data science. Well crafted statistical models allow data scientists to draw conclusions about t…☆11Jan 21, 2026Updated last month
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆11Jul 26, 2023Updated 2 years ago
- Course materials for UMBC DATA 690 - Statistical Analysis and Data Visualization with Python.☆12Dec 5, 2024Updated last year
- ☆11Dec 17, 2025Updated 2 months ago
- A framework to manage data, continuously☆33Jan 20, 2025Updated last year
- Python ETL demo for Hackforge☆32Oct 11, 2023Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,826Aug 26, 2022Updated 3 years ago
- Repository for the Honor Track of Recommender Systems Specialization from University of Minnesota on Coursera☆37Aug 25, 2019Updated 6 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆141Apr 18, 2020Updated 5 years ago
- My Graduate Capstone Project - This is a Product Recommendation System for a Local Wholesaler in India, using Python and Machine Learning…☆27Mar 20, 2021Updated 4 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Apr 17, 2024Updated last year
- In this Case Study I'm performing Exploratory Analysis & Building a model which will Classify if Patient has CHD or Not.☆14Jul 31, 2019Updated 6 years ago
- Covid19 Dashboard India☆12Feb 27, 2021Updated 5 years ago
- A Python library for reading the YXDB file format☆11Jul 11, 2024Updated last year
- You can encode and decode base85, ascii85, base64, base32, and base16 with this tool.☆11Oct 4, 2023Updated 2 years ago