shankarlohar / dbt-snowflake-data-pipelineLinks
π A structured data pipeline project using dbt and Snowflake to transform raw data into curated datasets. This project covers data ingestion, cleansing, enrichment, Slowly Changing Dimensions (SCD Type 2), and analytical modeling to derive business insights.
β10Updated 4 months ago
Alternatives and similar repositories for dbt-snowflake-data-pipeline
Users that are interested in dbt-snowflake-data-pipeline are comparing it to the libraries listed below
Sorting:
- End-To-End Data Engineering Project. Made to learn some common data engineering practices.β13Updated 2 months ago
- Code for "Efficient Data Processing in Spark" Courseβ327Updated 2 months ago
- Quickstart for any serviceβ158Updated this week
- β205Updated 6 months ago
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.β273Updated 4 months ago
- In this repository we store all materials for dlt workshops, courses, etc.β215Updated this week
- Transaction processing & vis pipeline using PySpark Streamingβ30Updated last year
- A demonstration of an ELT (Extract, Load, Transform) pipelineβ30Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principleβ¦β115Updated 4 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testinβ¦β73Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Supersetβ237Updated 5 months ago
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflakeβ199Updated last month
- Practical Data Engineering: A Hands-On Real-Estate Project Guideβ678Updated 11 months ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.β245Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testingβ271Updated last year
- A list of publicly available datasets with real-time data maintained by the team at bytewax.ioβ922Updated 5 months ago
- Sample project to demonstrate data engineering best practicesβ195Updated last year
- Example projects built on MotherDuckβ29Updated last month
- β15Updated 4 months ago
- Demo Project for Open Source MDSβ168Updated 2 months ago
- A repository defining a simple data pipleine for ETL jobs relating to media metadata.β23Updated 5 months ago
- Code for my "Efficient Data Processing in SQL" book.β57Updated last year
- Personal project for setting up an open source data warehouse.β31Updated last month
- Template for Data Engineering and Data Pipeline projectsβ114Updated 2 years ago
- Welcome to my GitHub repository. I hope you enjoy solving these puzzles as much as I have enjoyed creating them.β757Updated 5 months ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.β88Updated last year
- Streaming analytics project with eventsim and Kafkaβ12Updated 2 years ago
- A curated list of awesome public DBT projectsβ144Updated last year
- A handpicked collection of resources for Python developers in data engineering, machine learning, and AI. Inside, you'll discover a neatlβ¦β106Updated last year
- My own ETL pipeline of random users utilising Postgres for long term storage and Redis for caching. Served up via FastAPI and Dockerβ25Updated 9 months ago