sspaeti / data-engineer-handbookLinks
This is a repo with links to everything you'd ever want to learn about data engineering
☆10Updated 6 months ago
Alternatives and similar repositories for data-engineer-handbook
Users that are interested in data-engineer-handbook are comparing it to the libraries listed below
Sorting:
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Analytics Engineering best practices and standards used at Hiflylabs☆12Updated 2 weeks ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆53Updated 3 weeks ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 10 months ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆83Updated last year
- csv and flat-file sniffer built in Rust.☆42Updated last year
- dbt plugin for Palm CLI☆21Updated last year
- Data models for Hubspot built using dbt.☆38Updated last month
- A dbt-Core package for generating models from an activity stream.☆42Updated last year
- Example projects built on MotherDuck☆28Updated this week
- Data-diff solution for dbt-ers with Snowflake ❄️ 🚀☆18Updated 2 weeks ago
- ☆18Updated 9 months ago
- Automate and streamline the alerting & notification process for dbt test results🐞🚀☆17Updated last month
- Contribute to dlt verified sources 🔥☆85Updated this week
- ☆37Updated 2 months ago
- Code for data quality with greatexpectations blog☆12Updated 10 months ago
- 🏁 A sweet and speedy code generator for dbt 🏎️✨☆27Updated 11 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- duckdb-etl-framework☆11Updated 5 months ago
- Utility functions for dbt projects running on Spark☆34Updated 3 months ago
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆31Updated 6 months ago
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆22Updated last week
- Code to demonstrate data engineering metadata & logging best practices☆16Updated last year
- Repository for Data Engineering Interview Series☆31Updated 7 months ago
- A minimum viable setup for dbt with environment variables.☆16Updated 6 years ago
- Repo for CDC with debezium blog post☆28Updated 8 months ago
- F1 Data Pipeline☆23Updated last year
- ☆80Updated 7 months ago