danielbeach / polars-DeltaLakeLinks
Trying out the Dataframe Polars library with Delta Lake ... feat Python.
☆11Updated 5 months ago
Alternatives and similar repositories for polars-DeltaLake
Users that are interested in polars-DeltaLake are comparing it to the libraries listed below
Sorting:
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆211Updated 2 weeks ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- csv and flat-file sniffer built in Rust.☆42Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆25Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 3 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆86Updated 2 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆29Updated last year
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆34Updated 2 months ago
- Code for dbt tutorial☆156Updated last month
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆38Updated last year
- Template for Data Engineering and Data Pipeline projects☆112Updated 2 years ago
- ☆57Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 9 months ago
- Code for my "Efficient Data Processing in SQL" book.☆57Updated 11 months ago
- ☆80Updated 9 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆52Updated 8 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆218Updated 2 months ago
- Run, mock and test fake Snowflake databases locally.☆144Updated this week
- ☆52Updated 2 years ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆143Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆163Updated 4 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆73Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆34Updated 2 years ago
- Make dbt great again! Enables end user to extend dbt to his/her needs☆87Updated 2 weeks ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆195Updated last week
- Demo Project for Open Source MDS☆168Updated last month
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Write your dbt models using Ibis☆68Updated 4 months ago
- how to unit test your PySpark code☆29Updated 4 years ago