sonhmai / data-system-design
System Design, Solution Architecture, Data Systems Practice
☆25Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for data-system-design
- Code for "Efficient Data Processing in Spark" Course☆239Updated last month
- Sample project to demonstrate data engineering best practices☆164Updated 8 months ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆237Updated 4 months ago
- This repository helps teach people how to correctly define and create cumulative tables!☆421Updated 2 weeks ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆534Updated 2 months ago
- This repository goes over how to handle massive variety in data engineering☆93Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated last month
- ☆21Updated 7 months ago
- Sample repo for startdataengineering DE 101 free course☆35Updated 4 months ago
- Code for dbt tutorial☆143Updated 5 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆422Updated 3 weeks ago
- Template for Data Engineering and Data Pipeline projects☆104Updated last year
- ☆190Updated 2 weeks ago
- Simple stream processing pipeline☆91Updated 4 months ago
- Step by step instructions to create a production-ready data pipeline☆25Updated last month
- This is a template you can use for your next data engineering portfolio project.☆160Updated 3 years ago
- This repo has all the resources you need to become an amazing analytics engineer!☆80Updated 7 months ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆38Updated 3 months ago
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake☆100Updated 2 months ago
- ☆89Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆65Updated 3 weeks ago
- Data Engineering examples for Airflow, Prefect, and Mage.ai; dbt for BigQuery, Redshift, ClickHouse, PostgreSQL; Spark/PySpark for Batch …☆50Updated this week
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- Quickstart for any service☆129Updated this week
- A repository of sample code to accompany our blog post on Airflow and dbt.☆167Updated last year
- ☆444Updated last month
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆180Updated this week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆110Updated 3 months ago
- Repo for saving cheat sheets☆42Updated 5 months ago