danhphan / trusted-data-pipeline
Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for trusted-data-pipeline
- The Modern Data Stack in a (Smaller) Box☆12Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 2 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated last year
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆55Updated last year
- A serverless duckDB deployment at GCP☆35Updated 2 years ago
- DuckDB Power Query Custom Connector by MotherDuck☆45Updated 3 weeks ago
- Cost Efficient Data Pipelines with DuckDB☆46Updated 3 months ago
- dagster scikit-learn pipeline example.☆43Updated last year
- Evaluation Matrix for Change Data Capture☆24Updated 3 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆14Updated this week
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆12Updated 3 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 8 months ago
- Repo for orienting dbt users to the Dagster asset framework☆48Updated 2 years ago
- Dask integration for Snowflake☆30Updated 4 months ago
- Build your feature store with macros right within your dbt repository☆37Updated last year
- DuckDB Docker image☆24Updated 3 weeks ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆24Updated this week
- Fake Pandas / PySpark DataFrame creator☆43Updated 7 months ago
- Nicely modeled data built on the Github Archive.☆56Updated 8 months ago
- A cool simple example of functional data engineering☆33Updated last year
- A curated list of dagster code snippets for data engineers☆50Updated 8 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆37Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆42Updated 9 months ago
- Getting started with DuckDB, by Packt Publishing☆41Updated 3 months ago
- The Modern Data Stack in a Python package☆49Updated 11 months ago
- ☆29Updated 10 months ago
- Azure extension for DuckDB☆50Updated last week
- Utility functions for dbt projects running on Spark☆31Updated last year