data-engineering-helpers / ks-cheat-sheetsLinks
Knowledge sharing - Cheat sheets
☆14Updated this week
Alternatives and similar repositories for ks-cheat-sheets
Users that are interested in ks-cheat-sheets are comparing it to the libraries listed below
Sorting:
- Architecture principles☆13Updated 2 months ago
- ☆150Updated 2 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆52Updated 9 months ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆34Updated 6 months ago
- SQLMesh example projects☆33Updated last month
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆215Updated last month
- Unity Catalog UI☆42Updated 11 months ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- ☆41Updated last year
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- learning-by-doing data model built with dbt-core☆14Updated 8 months ago
- Contribute to dlt verified sources 🔥☆90Updated this week
- Cost Efficient Data Pipelines with DuckDB☆56Updated 2 months ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆60Updated 2 weeks ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 11 months ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆26Updated last year
- Databricks dbt factory library for creating Databricks Job definition where individual dbt models are run as separate tasks.☆18Updated 3 weeks ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated 2 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 4 months ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆124Updated 2 weeks ago
- A DataOps framework for building a lakehouse.☆52Updated this week
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆61Updated 2 years ago
- Dagster SQLMesh Adapter☆64Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 11 months ago
- All things awesome related to Dagster!☆122Updated last month
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Utility functions for dbt projects running on Spark☆33Updated 6 months ago
- A framework to manage data, continuously☆32Updated 6 months ago