danielbeach / drainageLinks
Rust + Python Lake House Health Analyzer | Detect • Diagnose • Optimize • Flow
☆67Updated 2 months ago
Alternatives and similar repositories for drainage
Users that are interested in drainage are comparing it to the libraries listed below
Sorting:
- Turning PySpark Into a Universal DataFrame API☆464Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated 2 weeks ago
- ☆360Updated last week
- Delta Lake helper methods in PySpark☆325Updated last year
- Run, mock and test fake Snowflake databases locally.☆160Updated this week
- ☆167Updated 7 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆123Updated 10 months ago
- Dagster Labs' open-source data platform, built with Dagster.☆425Updated this week
- Data Product Portal created by Dataminded☆196Updated last week
- Python wrapper for the Sling CLI tool☆62Updated last month
- The next-generation engine for dbt☆583Updated last week
- The smallest DuckDB SQL orchestrator on Earth.☆334Updated 3 weeks ago
- Package to assert rows in-line with dbt macros.☆69Updated 3 weeks ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆313Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆231Updated last month
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆257Updated last week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆193Updated 8 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆277Updated 2 months ago
- Apache DataFusion Python Bindings☆536Updated last week
- Delta Lake Documentation☆51Updated last year
- Read Apache Arrow batches from ODBC data sources in Python☆73Updated 2 weeks ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Updated 7 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆79Updated this week
- Delta Lake helper methods. No Spark dependency.☆23Updated last year
- Delta Lake examples☆235Updated last year
- ☆349Updated this week
- 🏃♀️ Minimalist SQL orchestrator☆295Updated this week
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆250Updated 10 months ago
- a compute manifest and tools for ML☆463Updated this week