caitpj / SQL-WatchPupLinks
A collection of Python tools for SQL data management, with a strong focus on simplicity, flexibility, and speed.
☆19Updated 7 months ago
Alternatives and similar repositories for SQL-WatchPup
Users that are interested in SQL-WatchPup are comparing it to the libraries listed below
Sorting:
- Data product portal created by Dataminded☆192Updated this week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆247Updated 2 weeks ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆41Updated last year
- The Open-Source Enterprise Data Platform in a single Portal☆258Updated this week
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 5 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated 2 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Quickstart for any service☆165Updated this week
- Make dbt great again! Enables end user to extend dbt to his/her needs☆200Updated this week
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆156Updated this week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆69Updated 2 weeks ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- ☆81Updated 8 months ago
- DBT Package reproducing dbt incremental materialization leveraging on Snowflake streams☆33Updated last month
- ☆39Updated 6 months ago
- ☆157Updated 3 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆35Updated last month
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆75Updated 5 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆117Updated 6 months ago
- Code snippets for Data Engineering Design Patterns book☆232Updated 7 months ago
- Delta Lake helper methods in PySpark☆323Updated last year
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆74Updated this week
- A Table format agnostic data sharing framework☆40Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆289Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆54Updated last week
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆223Updated this week