caitpj / SQL-WatchPupLinks
A collection of Python tools for SQL data management, with a strong focus on simplicity, flexibility, and speed.
☆16Updated 3 months ago
Alternatives and similar repositories for SQL-WatchPup
Users that are interested in SQL-WatchPup are comparing it to the libraries listed below
Sorting:
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆117Updated 2 months ago
- ☆80Updated 8 months ago
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆168Updated 3 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆232Updated 4 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 9 months ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆69Updated 2 months ago
- Airbyte made simple (no UI, no database, no cluster)☆175Updated 3 weeks ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆40Updated 10 months ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆208Updated last month
- Showcase of advanced use cases relating to CI in dbt☆81Updated this week
- Make dbt docs and Apache Superset talk to one another☆146Updated 5 months ago
- ☆150Updated last week
- A DataOps framework for building a lakehouse.☆50Updated this week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆142Updated 11 months ago
- ☆132Updated last month
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆120Updated this week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 10 months ago
- Dagster University courses☆90Updated this week
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆217Updated last week
- Repo for CDC with debezium blog post☆28Updated 9 months ago
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆13Updated 7 months ago
- Make dbt great again! Enables end user to extend dbt to his/her needs☆80Updated last week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆7Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Quickstart for any service☆155Updated this week
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Package to assert rows in-line with dbt macros.☆68Updated 2 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆174Updated last year