angelddaz / de-challenges
Project based learning for Data Engineering fundamentals.
☆13Updated 4 years ago
Alternatives and similar repositories for de-challenges:
Users that are interested in de-challenges are comparing it to the libraries listed below
- A repo to track data engineering projects☆13Updated 2 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆13Updated 3 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆24Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Big Data Demystified meetup and blog examples☆31Updated 8 months ago
- pyspark dataframe made easy☆16Updated 3 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- Snowflake Cookbook, published by Packt☆79Updated 2 years ago
- Challenge Data Engineer☆25Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Udacity Data Streaming Nanodegree Program☆22Updated 4 years ago
- Challenge for those applying to the Software Engineer, Big Data position☆35Updated 13 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 11 months ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- Python ETL demo for Hackforge☆31Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆28Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆52Updated 4 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Example of an ETL Pipeline using Airflow☆34Updated 7 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- Example repo to create end to end tests for data pipeline.☆23Updated 10 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆82Updated last year
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated last year
- Simple ETL pipeline using Python☆26Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆54Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆87Updated 4 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago