danielbeach / lakescum
A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.
☆23Updated 11 months ago
Alternatives and similar repositories for lakescum:
Users that are interested in lakescum are comparing it to the libraries listed below
- Pytest plugin for dbt core☆59Updated 2 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- Make dbt great again! Enables end user to extend dbt to his/her needs☆56Updated last week
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆115Updated last month
- Utility functions for dbt projects running on Spark☆31Updated last month
- ☆74Updated 4 months ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated last year
- ☆23Updated 3 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆116Updated last month
- A framework to manage data, continuously☆32Updated last month
- A dbt package for easily using production data in a development environment.☆40Updated 3 weeks ago
- Delta Lake Documentation☆49Updated 8 months ago
- A dbt-Core package for generating models from an activity stream.☆39Updated 11 months ago
- Package of macros for dbt to make it easier to protect your customers' data☆44Updated 2 years ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆42Updated last month
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆206Updated last week
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Repo contains the materializations for Data Engineers DataOps Framework☆30Updated last month
- Unity Catalog UI☆39Updated 6 months ago
- ☆51Updated 2 years ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆76Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆49Updated 4 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- Macros for generating dbt model data profiles☆85Updated 3 months ago
- csv and flat-file sniffer built in Rust.☆42Updated last year
- titan: a package manager for Snowflake DB☆23Updated 2 years ago
- dbt starter code for enterprise Snowflake usage data artifacts☆22Updated 2 years ago