rsyi / whaleLinks
π³ The stupidly simple CLI workspace for your data warehouse.
β728Updated 2 years ago
Alternatives and similar repositories for whale
Users that are interested in whale are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,561Updated last year
- π Notebook sharing hubβ500Updated last year
- This repository is a getting started guide to Singer.β1,303Updated 9 months ago
- Write python locally, execute SQL in your data warehouseβ269Updated 2 years ago
- Writes the Singer format from Pythonβ563Updated 2 months ago
- Guides and docs to help you get up and running with Apache Airflow.β806Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β126Updated 3 years ago
- Data Pipeline Framework using the singer.io specβ648Updated this week
- python automatic data quality check toolkitβ283Updated 4 years ago
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common funβ¦β216Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β168Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,082Updated 2 months ago
- Quilt is a data mesh for connecting people with actionable dataβ1,342Updated last week
- Repository for the ActivitySchema spec and supporting materialsβ419Updated 2 years ago
- Tool to automate data quality checks on data pipelinesβ255Updated 2 years ago
- Apache Airflow integration for dbtβ404Updated last year
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,099Updated this week
- Assets related to the operation of Fishtown Analytics.β419Updated 7 months ago
- The metrics layer for your data. Join us at https://metriql.com/slackβ307Updated 2 years ago
- β199Updated last year
- Agile Data Preparation Workflows madeΒ easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySparkβ1,511Updated 6 months ago
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonβ861Updated last year
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!β574Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.β254Updated last year
- MetricFlow allows you to define, build, and maintain metrics in code.β1,216Updated this week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflowβ2,080Updated last year
- Macros that generate dbt codeβ566Updated 2 months ago
- Monitor the stability of a Pandas or Spark dataframe βοΈβ501Updated 4 months ago
- Data ingestion library for Amundsen to build graph and search indexβ205Updated last year
- dbt + Metabase integrationβ524Updated 2 weeks ago