ognis1205 / delta-hub
A platform and cloud-based service for data sharing based on the Delta Sharing protocol.
☆21Updated 7 months ago
Alternatives and similar repositories for delta-hub:
Users that are interested in delta-hub are comparing it to the libraries listed below
- Unity Catalog UI☆39Updated 4 months ago
- A Table format agnostic data sharing framework☆38Updated 11 months ago
- Delta lake and filesystem helper methods☆50Updated 11 months ago
- Yet Another (Spark) ETL Framework☆18Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆192Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆168Updated last week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 4 months ago
- ☆27Updated 6 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆23Updated 10 months ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated last year
- Delta Acceptance Testing☆20Updated 6 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆47Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆48Updated 2 months ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆41Updated 6 months ago
- ✨ A Pydantic to PySpark schema library☆65Updated this week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆63Updated 4 months ago
- Fake Pandas / PySpark DataFrame creator☆44Updated 10 months ago
- csv and flat-file sniffer built in Rust.☆42Updated last year
- ☆15Updated 6 months ago
- Edit your data contract in the Data Contract Editor☆15Updated 3 months ago
- Data product portal created by Dataminded☆172Updated this week
- 🏁 A sweet and speedy code generator for dbt 🏎️✨☆25Updated 7 months ago
- Pythonic Iceberg REST Catalog☆72Updated 4 months ago
- Pytest plugin for dbt core☆58Updated 2 weeks ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆125Updated 2 weeks ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆73Updated last year
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆106Updated 2 weeks ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆57Updated last year
- ☆26Updated last month