fredrikhgrelland / data-meshLinks
A cloud native data mesh implementation
☆12Updated 4 years ago
Alternatives and similar repositories for data-mesh
Users that are interested in data-mesh are comparing it to the libraries listed below
Sorting:
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆145Updated last year
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 3 months ago
- Apache DataLab (incubating)☆153Updated 2 years ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆94Updated 3 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 9 months ago
- Helpers & syntactic sugar for PySpark.☆62Updated 3 weeks ago
- Data pipelines from re-usable components☆107Updated last month
- python automatic data quality check toolkit☆281Updated 5 years ago
- Data Tools Subjective List☆88Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can conta…☆24Updated 6 years ago
- Read Delta tables without any Spark☆47Updated last year
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 6 years ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆93Updated 3 weeks ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆55Updated 6 months ago
- Apache (Py)Spark type annotations (stub files).☆118Updated 3 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- A tool and library for easily deploying applications on Apache YARN☆145Updated last year
- A simple tool for plotting Spark ML's Decision Trees☆40Updated 3 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- Asynchronous actions for PySpark☆48Updated 4 years ago
- Tool to automate data quality checks on data pipelines☆256Updated 3 years ago
- ☆107Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Updated 2 years ago
- DataQuality for BigData☆145Updated 2 years ago