rsyi / whaleLinks
🐳 The stupidly simple CLI workspace for your data warehouse.
☆728Updated 2 years ago
Alternatives and similar repositories for whale
Users that are interested in whale are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them 😊☆1,570Updated last year
- Data Pipeline Framework using the singer.io spec☆657Updated last week
- Tool to automate data quality checks on data pipelines☆257Updated 3 years ago
- Writes the Singer format from Python☆576Updated 2 months ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning m…☆858Updated last year
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,271Updated this week
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,132Updated 3 weeks ago
- Write python locally, execute SQL in your data warehouse☆269Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- What's in your data? Extract schema, statistics and entities from datasets☆1,539Updated 3 months ago
- Repository for the ActivitySchema spec and supporting materials☆431Updated 3 years ago
- Generate and Visualize Data Lineage from query history☆327Updated 2 years ago
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to b…☆804Updated 3 years ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆624Updated last week
- ☆202Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 weeks ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆321Updated 2 years ago
- python automatic data quality check toolkit☆279Updated 5 years ago
- Apache Airflow integration for dbt☆412Updated last year
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 9 months ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆186Updated 2 years ago
- Guides and docs to help you get up and running with Apache Airflow.☆815Updated last month
- Assets related to the operation of Fishtown Analytics.☆419Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆260Updated 2 years ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆509Updated 4 months ago
- dbt + Metabase integration☆565Updated this week
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆354Updated last week
- 🚎 Notebook sharing hub☆501Updated 2 years ago