data-engineering-helpers / ks-cheat-sheetsLinks
Knowledge sharing - Cheat sheets
☆16Updated 2 weeks ago
Alternatives and similar repositories for ks-cheat-sheets
Users that are interested in ks-cheat-sheets are comparing it to the libraries listed below
Sorting:
- Architecture principles☆13Updated 4 months ago
- ☆156Updated 4 months ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆224Updated 2 months ago
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆30Updated 11 months ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆34Updated 8 months ago
- ☆59Updated 4 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- Unity Catalog UI☆43Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆53Updated 10 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- An example repository showing how to leverage Kafka to stream your data☆21Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆205Updated this week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆12Updated last week
- Edit your data contract in the Data Contract Editor☆25Updated 11 months ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Data product portal created by Dataminded☆190Updated last week
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆85Updated this week
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- ☆43Updated last year
- A DataOps framework for building a lakehouse.☆53Updated this week
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆26Updated last year
- Utility functions for dbt projects running on Spark☆33Updated 7 months ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated 2 years ago
- SQLMesh example projects☆35Updated 2 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 5 months ago
- A framework to manage data, continuously☆32Updated 8 months ago
- New generation opensource data stack☆73Updated 3 years ago