danielbeach / lakescumLinks
A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.
☆27Updated last year
Alternatives and similar repositories for lakescum
Users that are interested in lakescum are comparing it to the libraries listed below
Sorting:
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Updated 9 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Updated 3 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Updated 9 months ago
- ☆176Updated 8 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆125Updated last year
- Pytest plugin for dbt core☆63Updated last year
- Data Product Portal created by Dataminded☆197Updated last week
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- Write your dbt models using Ibis☆74Updated 10 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆223Updated last month
- 🏁 A sweet and speedy code generator for dbt 🏎️✨☆32Updated last week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆95Updated 2 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆126Updated 11 months ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- ☆158Updated last week
- ☆80Updated last year
- A DataOps framework for building a lakehouse.☆56Updated last month
- Package to assert rows in-line with dbt macros.☆69Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆194Updated this week
- ✨ A Pydantic to PySpark schema library☆118Updated this week
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆181Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆91Updated last week
- csv and flat-file sniffer built in Rust.☆44Updated 2 years ago
- Utility functions for dbt projects running on Spark☆34Updated last month
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆233Updated last month
- Repo for orienting dbt users to the Dagster asset framework☆56Updated 3 years ago