debussy-labs / debussy_concert
Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and pipelines.
☆28Updated last year
Alternatives and similar repositories for debussy_concert:
Users that are interested in debussy_concert are comparing it to the libraries listed below
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Make simple storing test results and visualisation of these in a BI dashboard☆42Updated last month
- ☆43Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- ☆20Updated 3 years ago
- Glue VSCode devcontainer setup☆14Updated 2 years ago
- Contains example dags and terraform code to create a composer with a node pool to run pods☆13Updated 4 years ago
- New generation opensource data stack☆65Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated last year
- A dbt (data build tool) project you can use for testing purposes or experimentation☆36Updated last year
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- Great Expectations Airflow operator☆160Updated this week
- Big Data Demystified meetup and blog examples☆31Updated 7 months ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆24Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- Data Catalog Tag Templates☆30Updated 5 months ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 2 months ago
- dbt package for monitoring airflow DAGs and tasks☆29Updated last month
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆82Updated this week
- event-triggered plugins for airflow☆21Updated 5 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Updated 3 years ago