Stefen-Taime / modern-data-pipeline
reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.
☆13Updated last year
Alternatives and similar repositories for modern-data-pipeline:
Users that are interested in modern-data-pipeline are comparing it to the libraries listed below
- build dw with dbt☆33Updated 2 months ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- ☆18Updated 5 months ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆45Updated last year
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆95Updated 5 months ago
- Example repo to create end to end tests for data pipeline.☆21Updated 7 months ago
- ☆40Updated 6 months ago
- Code for my "Efficient Data Processing in SQL" book.☆54Updated 5 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Updated last year
- A custom end-to-end analytics platform for customer churn☆10Updated 3 weeks ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆25Updated 2 years ago
- Some example projects for Data Engineers to build, end-to-end.☆27Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆71Updated 2 months ago
- ☆61Updated last week
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆10Updated last year
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆47Updated 3 years ago
- Cloned by the `dbt init` task☆60Updated 8 months ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆30Updated 10 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆70Updated last year
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆58Updated 2 years ago
- Repo for CDC with debezium blog post☆28Updated 4 months ago
- Cost Efficient Data Pipelines with DuckDB☆48Updated 5 months ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆17Updated 8 months ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 2 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆26Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆22Updated 2 years ago