unitycatalog / unitycatalog-pythonLinks
☆18Updated last year
Alternatives and similar repositories for unitycatalog-python
Users that are interested in unitycatalog-python are comparing it to the libraries listed below
Sorting:
- Unity Catalog UI☆43Updated last year
- ☆13Updated 2 years ago
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- Delta Lake examples☆227Updated 11 months ago
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆233Updated 8 months ago
- REST API for Apache Spark on K8S or YARN☆104Updated 3 weeks ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆246Updated 2 weeks ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 5 months ago
- A Table format agnostic data sharing framework☆39Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- a curated list of awesome lakehouse frameworks, applications, etc☆35Updated 7 months ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Updated 3 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆219Updated last month
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated 11 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆49Updated last year
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆38Updated this week
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated last year
- Proof-of-concept extension combining the delta extension with Unity Catalog☆89Updated 2 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆413Updated 4 months ago
- ☆267Updated 11 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆139Updated last month
- Apache Hive Metastore as a Standalone server in Docker☆80Updated last year
- Delta Lake Documentation☆50Updated last year
- Yet Another (Spark) ETL Framework☆21Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- Drop-in replacement for Apache Spark UI☆311Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆229Updated 2 months ago
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆85Updated this week