danielbeach / drainageLinks
Rust + Python Lake House Health Analyzer | Detect • Diagnose • Optimize • Flow
☆65Updated 3 months ago
Alternatives and similar repositories for drainage
Users that are interested in drainage are comparing it to the libraries listed below
Sorting:
- Turning PySpark Into a Universal DataFrame API☆485Updated this week
- Run, mock and test fake Snowflake databases locally.☆169Updated last week
- ☆179Updated 8 months ago
- Python wrapper for the Sling CLI tool☆63Updated last month
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆125Updated last year
- ☆393Updated last week
- The smallest DuckDB SQL orchestrator on Earth.☆336Updated 2 months ago
- 🏃♀️ Minimalist SQL orchestrator☆302Updated this week
- A Rust based data/CSV/Parquet file generator☆63Updated 11 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆236Updated 3 months ago
- A compute manifest and composable tools for ML, built on Ibis, DataFusion, and Arrow Flight.☆484Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆226Updated this week
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Updated 10 months ago
- Dagster Labs' open-source data platform, built with Dagster.☆435Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆258Updated last month
- Data Product Portal created by Dataminded☆198Updated this week
- Delta Lake helper methods in PySpark☆327Updated 3 weeks ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Updated 9 months ago
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆272Updated last week
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆250Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- Package to assert rows in-line with dbt macros.☆69Updated 2 months ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Updated 2 years ago
- A Postgres Proxy Server in Python☆314Updated last year
- Delta Lake helper methods. No Spark dependency.☆22Updated 3 weeks ago
- ☆376Updated this week
- Write your dbt models using Ibis☆75Updated 10 months ago
- Read Apache Arrow batches from ODBC data sources in Python☆74Updated 3 weeks ago
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆194Updated 10 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆197Updated this week