fredrikhgrelland / data-meshLinks
A cloud native data mesh implementation
☆12Updated 4 years ago
Alternatives and similar repositories for data-mesh
Users that are interested in data-mesh are comparing it to the libraries listed below
Sorting:
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year
- Helpers & syntactic sugar for PySpark.☆62Updated 2 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 5 months ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- Python binding for DataFusion☆59Updated 3 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Dask integration for Snowflake☆30Updated last month
- Deploy dask on YARN clusters☆69Updated last year
- This repository is no longer maintained.☆15Updated 3 years ago
- Apache DataLab (incubating)☆152Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆89Updated last week
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Python - Java/Scala API for the Hopsworks feature store☆54Updated last month
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated this week
- Arrow, pydantic style☆84Updated 2 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Data Tools Subjective List☆87Updated 2 years ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆144Updated last year
- Read Delta tables without any Spark☆47Updated last year
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Updated 2 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Updated 2 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆55Updated 2 months ago
- Apache (Py)Spark type annotations (stub files).☆117Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- real-time data + ML pipeline☆54Updated last week
- A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking☆20Updated 7 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 4 years ago