fredrikhgrelland / data-meshLinks
A cloud native data mesh implementation
☆12Updated 5 years ago
Alternatives and similar repositories for data-mesh
Users that are interested in data-mesh are comparing it to the libraries listed below
Sorting:
- Apache DataLab (incubating)☆152Updated 2 years ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆94Updated 3 years ago
- Helpers & syntactic sugar for PySpark.☆62Updated 2 months ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆93Updated this week
- Data pipelines from re-usable components☆107Updated 2 months ago
- Data Tools Subjective List☆89Updated 2 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Updated 2 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆268Updated 10 months ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 4 months ago
- Data Catalog for Databases and Data Warehouses☆36Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆55Updated 4 months ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- A tool and library for easily deploying applications on Apache YARN☆146Updated last year
- Ibis analytics, with Ibis (and more!)☆24Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆107Updated 3 years ago
- ☆108Updated 3 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆46Updated last week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆32Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆55Updated 2 weeks ago
- A simple tool for plotting Spark ML's Decision Trees☆40Updated 4 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- ☆22Updated 3 weeks ago