KennethanCeyer / awesome-data-pipelineLinks
Awesome list for datapipeline
☆34Updated 2 years ago
Alternatives and similar repositories for awesome-data-pipeline
Users that are interested in awesome-data-pipeline are comparing it to the libraries listed below
Sorting:
- Apache Spark Guide☆31Updated 3 years ago
- Spark data pipeline that processes movie ratings data.☆28Updated last week
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- ☆12Updated 3 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆137Updated 5 years ago
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆33Updated 3 years ago
- dlt-dagster-demo☆11Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆58Updated last year
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated 2 years ago
- A curated list of awesome DataOps tools☆190Updated 8 months ago
- a curated list of awesome lakehouse frameworks, applications, etc☆32Updated 4 months ago
- Some example projects for Data Engineers to build, end-to-end.☆30Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆84Updated 2 years ago
- Data Tools Subjective List☆83Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- New generation opensource data stack☆69Updated 3 years ago
- A curated list of dagster code snippets for data engineers☆55Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆178Updated 3 years ago
- Simple stream processing pipeline☆102Updated last year
- Awesome List for Data Operations☆24Updated 4 years ago
- ☆18Updated 10 months ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. Ther…☆24Updated this week
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆35Updated 2 years ago
- ☆41Updated 11 months ago
- Code snippets for Data Engineering Design Patterns book☆119Updated 3 months ago
- Curated list of resources about Apache Airflow☆19Updated 4 years ago