edgBR / delta-lake-polarsLinks
Building a poor man's data lake: Exploring the Power of Polars and Delta Lake
☆11Updated 3 months ago
Alternatives and similar repositories for delta-lake-polars
Users that are interested in delta-lake-polars are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- Cost Efficient Data Pipelines with DuckDB☆58Updated 5 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆119Updated 7 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆54Updated 3 weeks ago
- Personal project for setting up an open source data warehouse.☆31Updated 3 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆251Updated last month
- Possibly the fastest DataFrame-agnostic quality check library in town.☆225Updated last week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆226Updated 3 weeks ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆36Updated 5 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆270Updated last month
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆14Updated 6 months ago
- Data product portal created by Dataminded☆193Updated last week
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆26Updated last year
- ☆16Updated last year
- Delta Lake helper methods. No Spark dependency.☆23Updated last year
- Python project template for Snowpark development☆79Updated 2 years ago
- ☆15Updated last year
- Azure extension for DuckDB☆67Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Updated last year
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆153Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆74Updated 3 weeks ago
- Dagster University courses☆114Updated 2 weeks ago
- ☆160Updated 5 months ago
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆64Updated 5 months ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Updated 11 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated 11 months ago
- ☆20Updated last year
- Git Repo for EDW Best Practice Assets on the Lakehouse☆15Updated last year
- Python wrapper for the Sling CLI tool☆58Updated last week