edgBR / delta-lake-polarsLinks
Building a poor man's data lake: Exploring the Power of Polars and Delta Lake
☆11Updated last month
Alternatives and similar repositories for delta-lake-polars
Users that are interested in delta-lake-polars are comparing it to the libraries listed below
Sorting:
- ☆30Updated last year
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Updated 3 months ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Updated last year
- Personal project for setting up an open source data warehouse.☆31Updated 6 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Updated 10 months ago
- A DataOps framework for building a lakehouse.☆56Updated last month
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆233Updated last month
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆279Updated 3 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆258Updated last month
- Delta Lake helper methods. No Spark dependency.☆22Updated last week
- Fabric Python Notebooks examples☆102Updated last week
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆179Updated 3 weeks ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆45Updated last week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆162Updated last year
- Delta Lake examples☆237Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Updated last year
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated last year
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆176Updated 2 weeks ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆131Updated this week
- This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source to…☆37Updated 2 years ago
- Python project template for Snowpark development☆80Updated 2 years ago
- Azure extension for DuckDB☆71Updated this week
- ☆392Updated this week
- Data Product Portal created by Dataminded☆197Updated this week
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆125Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- Example files used in the DuckDB - Unity Catalog blog☆10Updated last year
- DBT Package reproducing dbt incremental materialization leveraging on Snowflake streams☆34Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago