rsyi / whaleLinks
๐ณ The stupidly simple CLI workspace for your data warehouse.
โ728Updated 2 years ago
Alternatives and similar repositories for whale
Users that are interested in whale are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them ๐โ1,566Updated last year
- What's in your data? Extract schema, statistics and entities from datasetsโ1,499Updated 3 months ago
- Writes the Singer format from Pythonโ568Updated 3 weeks ago
- Tool to automate data quality checks on data pipelinesโ254Updated 2 years ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mโฆโ853Updated last year
- Write python locally, execute SQL in your data warehouseโ270Updated 3 years ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using MLโ1,061Updated this week
- Data Pipeline Framework using the singer.io specโ649Updated this week
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!โ580Updated this week
- Repository for the ActivitySchema spec and supporting materialsโ420Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioโ2,142Updated this week
- Generate and Visualize Data Lineage from query historyโ326Updated last year
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.โ266Updated 3 months ago
- ๐ Notebook sharing hubโ499Updated last year
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.โ185Updated 2 years ago
- The metrics layer for your data. Join us at https://metriql.com/slackโ309Updated 2 years ago
- MetricFlow allows you to define, build, and maintain metrics in code.โ1,235Updated this week
- Open source data observability platformโ326Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.โ126Updated 3 years ago
- Assets related to the operation of Fishtown Analytics.โ418Updated 9 months ago
- Monitor the stability of a Pandas or Spark dataframe โ๏ธโ503Updated 5 months ago
- Python API for Deequโ787Updated 3 months ago
- Guides and docs to help you get up and running with Apache Airflow.โ807Updated 2 years ago
- Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ชโ344Updated last week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.โ169Updated last year
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bโฆโ796Updated 2 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.โ254Updated last year
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonโ859Updated 2 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewโฆโ2,094Updated 3 months ago
- dbt + Metabase integrationโ535Updated 2 weeks ago