data-engineering-helpers / ks-cheat-sheetsLinks
Knowledge sharing - Cheat sheets
☆18Updated 2 weeks ago
Alternatives and similar repositories for ks-cheat-sheets
Users that are interested in ks-cheat-sheets are comparing it to the libraries listed below
Sorting:
- Architecture principles☆13Updated 7 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- ☆169Updated 7 months ago
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆32Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆36Updated 11 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆20Updated 2 months ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆68Updated 3 weeks ago
- ☆62Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Updated 2 months ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆181Updated last year
- Dagster SQLMesh Adapter☆77Updated 2 months ago
- Contribute to dlt verified sources 🔥☆102Updated last month
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- Manage Unity Catalog tables with Pydantic Models☆10Updated 10 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆233Updated 2 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Updated last year
- A DataOps framework for building a lakehouse.☆55Updated 3 weeks ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- An example repository showing how to leverage Kafka to stream your data☆21Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆90Updated last month
- ☆151Updated last month
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated 2 weeks ago
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- Write your dbt models using Ibis☆74Updated 9 months ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆230Updated last month
- A curated list of dagster code snippets for data engineers☆56Updated last year
- All things awesome related to Dagster!☆139Updated this week
- Unity Catalog UI☆43Updated last year
- ☆376Updated this week