debussy-labs / debussy_concert
Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and pipelines.
☆28Updated 2 years ago
Alternatives and similar repositories for debussy_concert:
Users that are interested in debussy_concert are comparing it to the libraries listed below
- Make simple storing test results and visualisation of these in a BI dashboard☆44Updated last month
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- The go to demo for public and private dbt Learn☆77Updated last month
- dbt's adapter for dremio☆48Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- dbt-github-workflow is a boilerplate that contains all the necessary configurations to set up a simple CI/CD pipeline for your data model…☆17Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Enforce Best Practices for all your Airflow DAGs. ⭐☆99Updated last week
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Package of macros for dbt to make it easier to protect your customers' data☆46Updated 2 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Updated 3 years ago
- Weekly Data Engineering Newsletter☆94Updated 9 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆36Updated 2 months ago
- A provider package for DuckDB☆16Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Evaluation Matrix for Change Data Capture☆25Updated 9 months ago
- A dbt-Core package for generating models from an activity stream.☆41Updated last year
- Run dbt serverless in the Cloud (AWS)☆42Updated 5 years ago
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- Utility functions for dbt projects running on Spark☆33Updated 2 months ago
- Pytest plugin for dbt core☆60Updated 3 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 8 months ago
- A pyspark lib to validate data quality☆18Updated 2 years ago