jason-jz-zhu / databathing
☆22Updated last month
Alternatives and similar repositories for databathing:
Users that are interested in databathing are comparing it to the libraries listed below
- A Minimalistic Rust Implementation of Delta Sharing Server.☆89Updated 3 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Yet Another (Spark) ETL Framework☆20Updated last year
- Data Catalog for Databases and Data Warehouses☆33Updated last year
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- Unity Catalog UI☆40Updated 7 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆87Updated this week
- The Internals of PySpark☆26Updated 3 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- ☆11Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆49Updated this week
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆38Updated 2 weeks ago
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆16Updated last week
- ☆32Updated last year
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 10 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 6 months ago
- Utility functions for dbt projects running on Spark☆32Updated 2 months ago
- Inspect Your Servers with DuckDB☆30Updated 2 years ago
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆12Updated 5 months ago
- ☆13Updated last week
- Transporter for integrating OpenLineage with OpenMetadata☆12Updated last year