Dataform is a framework for managing SQL based data operations in BigQuery
☆982Jun 5, 2026Updated this week
Alternatives and similar repositories for dataform
Users that are interested in dataform are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dataform Tools - VS Code extension to run and visualise Dataform data pipelines and much more☆93Updated this week
- Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.☆1,300Jun 2, 2026Updated last week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,952Updated this week
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,366Updated this week
- dataform-osmosis is a CLI tool for refactoring and managing Dataform SQLX files.☆17Oct 27, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,119Jun 1, 2026Updated last week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataform☆13Jun 6, 2023Updated 3 years ago
- Compare tables within or across databases☆2,988May 17, 2024Updated 2 years ago
- Supercharge BigQuery with BigFunctions☆758Apr 13, 2026Updated last month
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,613Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,771Jun 1, 2026Updated last week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆2,355Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,568Apr 30, 2024Updated 2 years ago
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,532Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Provides automated YAML management and a streamlit workbench. Designed to optimize dev workflows.☆629Updated this week
- Data Pipeline Framework using the singer.io spec☆660May 21, 2026Updated 2 weeks ago
- Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both …☆21,397Updated this week
- Agentic BI. Analytics at the speed of code ⚡️☆5,872Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,489Updated this week
- An Open Standard for lineage metadata collection☆2,497Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆15,612Jun 1, 2026Updated last week
- Utility to identify and rewrite common anti patterns in BigQuery SQL syntax☆120Aug 26, 2025Updated 9 months ago
- This plugin works with SQLFluff, the SQL linter for humans, to correctly parse and compile SQL projects using Dataform.☆28May 31, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Always know what to expect from your data.☆11,548Updated this week
- GoogleSQL(formerly ZetaSQL) - Analyzer Framework for SQL☆2,630Jan 31, 2026Updated 4 months ago
- Data Quality Engine for BigQuery☆279Mar 27, 2026Updated 2 months ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning m…☆853Apr 5, 2024Updated 2 years ago
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆932Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,205Updated this week
- 📦 Serverless and local-first Open Data Platform☆312May 19, 2026Updated 2 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆125Feb 5, 2025Updated last year
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineer…☆587Feb 5, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆37Jan 1, 2026Updated 5 months ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,463Updated this week
- Python SQL Parser and Transpiler☆9,301Updated this week
- The fastest business intelligence tool for humans and agents.☆2,644Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆276May 29, 2026Updated last week
- dbt package to monitor BigQuery assets (tables & queries)☆108Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆9,734Updated this week