zsvoboda / ngods
New generation opensource data stack
☆60Updated 2 years ago
Related projects: ⓘ
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆68Updated last year
- Delta Lake Documentation☆45Updated 3 months ago
- ☆20Updated 3 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆100Updated 2 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆187Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆163Updated last week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆73Updated 2 weeks ago
- Code snippets for Data Engineering Design Patterns book☆27Updated this week
- Data-aware orchestration with dagster, dbt, and airbyte☆29Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆52Updated 5 months ago
- Full stack data engineering tools and infrastructure set-up☆38Updated 3 years ago
- A write-audit-publish implementation on a data lake without the JVM☆39Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆45Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆51Updated last year
- ☆132Updated this week
- Data Tools Subjective List☆80Updated last year
- A curated list of dagster code snippets for data engineers☆48Updated 6 months ago
- ☆67Updated this week
- Cloned by the `dbt init` task☆58Updated 4 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆71Updated last year
- Cost Efficient Data Pipelines with DuckDB☆42Updated last month
- Quickstart for any service☆110Updated this week
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- Example repository showing how to build a data platform with Prefect, dbt and Snowflake☆90Updated last year
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆122Updated 2 years ago
- ☆20Updated 3 years ago
- Python wrapper for the Sling CLI tool☆37Updated 2 months ago
- ☆60Updated last month
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆102Updated this week
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆22Updated 5 months ago