data-engineering-helpers / ks-cheat-sheetsLinks
Knowledge sharing - Cheat sheets
☆19Updated this week
Alternatives and similar repositories for ks-cheat-sheets
Users that are interested in ks-cheat-sheets are comparing it to the libraries listed below
Sorting:
- Architecture principles☆13Updated 8 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 3 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Updated 3 months ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆70Updated last month
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Contribute to dlt verified sources 🔥☆104Updated 2 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated 2 years ago
- ☆179Updated 8 months ago
- An example repository showing how to leverage Kafka to stream your data☆21Updated last year
- Manage Unity Catalog tables with Pydantic Models☆10Updated 11 months ago
- A framework to manage data, continuously☆32Updated last year
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆22Updated 3 weeks ago
- Unity Catalog UI☆43Updated last year
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆39Updated 2 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆182Updated last year
- Utility functions for dbt projects running on Spark☆34Updated last month
- A curated list of dagster code snippets for data engineers☆56Updated last year
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Updated 2 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Updated last year
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- ☆393Updated last week
- ☆19Updated last year
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆33Updated last year
- ☆46Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆91Updated 2 weeks ago
- A DataOps framework for building a lakehouse.☆56Updated last month
- Write your dbt models using Ibis☆75Updated 10 months ago
- A curated list of awesome DuckLake tools and resources☆77Updated 3 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆236Updated 3 months ago