pracdata / duckdb-pipelineLinks
Demonstrating the capabilities of DuckDB as a transformation engine for data lakes
☆28Updated 8 months ago
Alternatives and similar repositories for duckdb-pipeline
Users that are interested in duckdb-pipeline are comparing it to the libraries listed below
Sorting:
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- ☆50Updated last month
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆117Updated 2 months ago
- Unity Catalog UI☆40Updated 9 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆52Updated 7 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆152Updated last week
- ☆18Updated 10 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- ☆25Updated 2 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆6Updated this week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆57Updated last week
- ☆12Updated last month
- Utility functions for dbt projects running on Spark☆34Updated 4 months ago
- SQLMesh example projects☆29Updated 7 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆43Updated 7 months ago
- ☆132Updated last month
- ☆80Updated 8 months ago
- OpsCenter for Snowflake makes it easy to understand and manage your Snowflake consumption☆25Updated last year
- Example files used in the DuckDB - Unity Catalog blog☆10Updated 6 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Delta Lake Documentation☆49Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆25Updated last year
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆17Updated last month
- Nicely modeled data built on the Github Archive.☆66Updated last year
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆21Updated 2 years ago
- ☆37Updated 3 months ago
- learning-by-doing data model built with dbt-core☆13Updated 6 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- Pythonic Iceberg REST Catalog☆1Updated last week