debussy-labs / debussy_concertLinks
Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and pipelines.
☆28Updated 2 years ago
Alternatives and similar repositories for debussy_concert
Users that are interested in debussy_concert are comparing it to the libraries listed below
Sorting:
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Updated 3 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆71Updated last year
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 5 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆45Updated last week
- ☆22Updated 4 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 11 months ago
- Contains example dags and terraform code to create a composer with a node pool to run pods☆13Updated 4 years ago
- End-to-end DataOps platform deployed by Terraform.☆67Updated 3 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated 3 weeks ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆56Updated this week
- Data Catalog Tag Templates☆30Updated 2 months ago
- BigQuery Schema Conversion Tool☆23Updated 4 years ago
- ☆46Updated last year
- Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Cat…☆56Updated last week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Great Expectations Airflow operator☆167Updated this week
- A dbt (data build tool) project you can use for testing purposes or experimentation☆37Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 11 months ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Updated 3 years ago
- Read Delta tables without any Spark☆47Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆154Updated this week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week