allegro / bigflow
A Python framework for data processing on GCP.
☆117Updated last month
Related projects ⓘ
Alternatives and complementary repositories for bigflow
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated 5 months ago
- Astronomer Core Docker Images☆106Updated 6 months ago
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- Airflow declarative DAGs via YAML☆131Updated last year
- ☆196Updated last year
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Airflow Unit Tests and Integration Tests☆256Updated 2 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆88Updated 2 years ago
- The Picnic Data Vault framework.☆126Updated 5 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆37Updated this week
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 3 weeks ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 3 years ago
- makes your sql less bad☆60Updated 4 years ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆29Updated last year
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆377Updated this week
- ☆48Updated 4 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆168Updated last year
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆262Updated last year
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆71Updated last year
- ✨ A Pydantic to PySpark schema library☆57Updated this week
- Define, govern, and model event data for warehouse-first product analytics.☆82Updated 4 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆217Updated this week
- dbt's adapter for dremio☆48Updated 2 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated last month