adidas / m3d-apiLinks
Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of data lakes.
☆31Updated 2 years ago
Alternatives and similar repositories for m3d-api
Users that are interested in m3d-api are comparing it to the libraries listed below
Sorting:
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 4 years ago
- ☆95Updated 2 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Updated 6 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated last month
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Fivetran data models for QuickBooks using dbt.☆34Updated this week
- Deployment tools/scripts for Metaflow!☆56Updated 2 years ago
- A curated list of awesome Databricks resources, including Spark☆20Updated last year
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆59Updated this week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆104Updated 2 years ago
- A python client library for the Stitch Import API☆42Updated last year
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆38Updated 4 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆46Updated 2 months ago
- FADI - Ingest, store and analyse big data flows☆46Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- ☆11Updated 5 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago
- New generation opensource data stack☆70Updated 3 years ago
- Repository for building CDAP and additional external projects☆16Updated this week
- ODD Specification is a universal open standard for collecting metadata.☆142Updated 8 months ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆65Updated last year
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- dbt data models for facebook ads☆40Updated 7 months ago