vmware / versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
☆446Updated 2 weeks ago
Alternatives and similar repositories for versatile-data-kit:
Users that are interested in versatile-data-kit are comparing it to the libraries listed below
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆369Updated this week
- ODD Specification is a universal open standard for collecting metadata.☆137Updated 5 months ago
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆433Updated last week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineer…☆537Updated last month
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆153Updated this week
- A curated list of awesome DataOps tools☆185Updated 6 months ago
- A dbt-core plugin to weave together multi-project dbt-core deployments☆144Updated 3 weeks ago
- New Generation Opensource Data Stack Demo☆431Updated 2 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆232Updated last month
- PyAirbyte brings the power of Airbyte to every Python developer.☆261Updated last week
- The metrics layer for your data. Join us at https://metriql.com/slack☆306Updated 2 years ago
- Make dbt docs and Apache Superset talk to one another☆141Updated 3 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆244Updated 2 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆252Updated last year
- Dagster Labs' open-source data platform, built with Dagster.☆344Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆225Updated 2 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆111Updated 3 weeks ago
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆355Updated 2 weeks ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆141Updated 3 weeks ago
- Generate the ERD as a code from dbt artifacts☆244Updated this week
- dbt (data build tool) adapter for the Dremio☆51Updated last week
- Quickstart for any service☆145Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆261Updated last week
- Quick Guides from Dremio on Several topics☆70Updated 3 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated this week
- Turning PySpark Into a Universal DataFrame API☆387Updated this week
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆194Updated last week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆187Updated this week
- Macros for calculating metrics☆220Updated 2 months ago
- Home of the Open Data Contract Standard (ODCS).☆477Updated last week