caitpj / SQL-WatchPupLinks
A collection of Python tools for SQL data management, with a strong focus on simplicity, flexibility, and speed.
☆21Updated 9 months ago
Alternatives and similar repositories for SQL-WatchPup
Users that are interested in SQL-WatchPup are comparing it to the libraries listed below
Sorting:
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆177Updated this week
- Data Product Portal created by Dataminded☆197Updated last week
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- ☆80Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆123Updated 9 months ago
- Make simple storing test results and visualisation of these in a BI dashboard☆51Updated 3 weeks ago
- Make dbt great again! Extend dbt with plugins, local docs and custom adapters — fast, safe, and developer-friendly☆273Updated 3 weeks ago
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆11Updated last year
- ☆158Updated last month
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated last month
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Updated 6 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆20Updated 2 months ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆41Updated last year
- A Table format agnostic data sharing framework☆42Updated last year
- Pytest plugin for dbt core☆63Updated 11 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆49Updated 2 weeks ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆183Updated 2 years ago
- Package to assert rows in-line with dbt macros.☆69Updated last month
- Possibly the fastest DataFrame-agnostic quality check library in town.☆233Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated last week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆257Updated 3 weeks ago
- New generation opensource data stack☆76Updated 3 years ago
- Showcase of advanced use cases relating to CI in dbt☆95Updated 2 weeks ago
- A guide for leading a data (engineering) team☆63Updated last year
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆72Updated 3 weeks ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Updated 2 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆260Updated 2 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆78Updated 8 months ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated 2 years ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆125Updated 11 months ago