sspaeti / data-engineer-handbookLinks
This is a repo with links to everything you'd ever want to learn about data engineering
☆10Updated last year
Alternatives and similar repositories for data-engineer-handbook
Users that are interested in data-engineer-handbook are comparing it to the libraries listed below
Sorting:
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 3 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆60Updated 6 months ago
- Code for dbt tutorial☆165Updated 3 months ago
- New generation opensource data stack☆76Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆44Updated last year
- Template for Data Engineering and Data Pipeline projects☆115Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆288Updated 8 months ago
- End to end data engineering project☆57Updated 3 years ago
- A curated list of awesome DataOps tools☆211Updated 5 months ago
- Weekly Data Engineering Newsletter☆97Updated last year
- Some example projects for Data Engineers to build, end-to-end.☆36Updated 2 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆224Updated 7 months ago
- A guide for leading a data (engineering) team☆63Updated last year
- Cloned by the `dbt init` task☆62Updated last year
- Apache Airflow Best Practices, published by Packt☆51Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆95Updated 6 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- ☆44Updated last year
- A custom end-to-end analytics platform for customer churn☆11Updated 6 months ago
- ☆213Updated 10 months ago
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated last year
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆139Updated 5 years ago
- ☆39Updated 9 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Updated last year
- Local development environment for python data projects, with Docker☆23Updated 2 years ago