dbt-labs / dbt-coreLinks
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
☆12,224Updated this week
Alternatives and similar repositories for dbt-core
Users that are interested in dbt-core are comparing it to the libraries listed below
Sorting:
- An orchestration platform for the development, production, and observation of data assets.☆14,930Updated this week
- Always know what to expect from your data.☆11,133Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆9,474Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,891Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,341Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,735Updated last week
- Utility functions for dbt projects.☆1,687Updated 3 weeks ago
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆4,903Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆20,668Updated this week
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,281Updated last week
- Compare tables within or across databases☆2,991Updated last year
- Self-serve BI to 10x your data team ⚡️☆5,535Updated this week
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆5,870Updated last month
- Python SQL Parser and Transpiler☆8,881Updated last week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆2,242Updated last week
- An Open Standard for lineage metadata collection☆2,304Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,577Updated this week
- OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata rep…☆8,674Updated this week
- Curated list of resources about Apache Airflow☆3,885Updated 2 weeks ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,569Updated last year
- This repository is a getting started guide to Singer.☆1,326Updated 6 months ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,515Updated last year
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,119Updated this week
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,645Updated this week
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,580Updated last week
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,468Updated last week
- Construct Apache Airflow DAGs Declaratively via YAML configuration files☆1,415Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,486Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,575Updated this week
- the portable Python dataframe library☆6,385Updated this week