fredrikhgrelland / data-mesh
A cloud native data mesh implementation
☆12Updated 3 years ago
Related projects: ⓘ
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking☆20Updated 6 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆53Updated last year
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- ☆21Updated last month
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- Ibis analytics, with Ibis (and more!)☆19Updated this week
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆22Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 9 months ago
- This repository is no longer maintained.☆15Updated 2 years ago
- Data Catalog for Databases and Data Warehouses☆31Updated 8 months ago
- Build your feature store with macros right within your dbt repository☆37Updated last year
- A dbt adapter for Decodable☆11Updated 8 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆49Updated 2 weeks ago
- ☆22Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- Helpers & syntactic sugar for PySpark.☆60Updated last year
- Dask integration for Snowflake☆29Updated 2 months ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆9Updated last year
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆41Updated 3 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆16Updated 3 years ago
- Pylint plugin for static code analysis on Airflow code☆89Updated 3 years ago
- ☆19Updated last year
- Code examples for the Introduction to Kubeflow course☆13Updated 3 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago