caitpj / SQL-WatchPupLinks
A collection of Python tools for SQL data management, with a strong focus on simplicity, flexibility, and speed.
☆19Updated 8 months ago
Alternatives and similar repositories for SQL-WatchPup
Users that are interested in SQL-WatchPup are comparing it to the libraries listed below
Sorting:
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆171Updated this week
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆41Updated last year
- Make dbt great again! Extend dbt with plugins, local docs and custom adapters — fast, safe, and developer-friendly☆259Updated last week
- Data Product Portal created by Dataminded☆195Updated this week
- The Picnic Data Vault framework.☆130Updated last year
- ☆40Updated 7 months ago
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆12Updated last year
- Quickstart for any service☆167Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆121Updated 7 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆254Updated last month
- A Table format agnostic data sharing framework☆42Updated last year
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆76Updated 6 months ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- ☆80Updated last year
- ☆157Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated this week
- dbt (data build tool) adapter for the Dremio☆52Updated 3 weeks ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆123Updated 9 months ago
- New generation opensource data stack☆75Updated 3 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆226Updated last month
- Library to convert DBT manifest metadata to Airflow tasks☆49Updated last year
- Streaming demo dbt☆17Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆274Updated last month
- Iceberg Playground in a Box☆67Updated 4 months ago
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆71Updated this week
- Delta Lake helper methods in PySpark☆324Updated last year
- Witboost is a versatile platform that addresses a wide range of sophisticated data engineering challenges. The Starter Kit showcases the …☆25Updated 3 weeks ago
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆168Updated last week