A Python framework for data processing on GCP.
☆120Apr 9, 2025Updated 10 months ago
Alternatives and similar repositories for bigflow
Users that are interested in bigflow are comparing it to the libraries listed below
Sorting:
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆13Mar 20, 2023Updated 2 years ago
- CLI for data platform☆21Nov 12, 2025Updated 3 months ago
- ☆25Feb 25, 2026Updated last week
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Jul 21, 2021Updated 4 years ago
- A tool for ~/.ssh/config generation☆50Aug 26, 2025Updated 6 months ago
- Sample code for demonstrating and exploring class loader related memory leaks☆15Mar 29, 2018Updated 7 years ago
- Kedro Snowflake / Snowpark plugin☆14Jul 19, 2024Updated last year
- Data Quality Engine for BigQuery☆280May 19, 2025Updated 9 months ago
- GCP Workflows visual editor☆15Oct 31, 2021Updated 4 years ago
- ☆19Apr 6, 2022Updated 3 years ago
- Learning Google BigQuery, published by Packt☆15Jan 30, 2023Updated 3 years ago
- Gradle plugin with integrationTest task☆93Feb 1, 2026Updated last month
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-run☆10Oct 31, 2023Updated 2 years ago
- A small library of hive UDFS using Macros to process and manipulate complex types☆15Oct 2, 2025Updated 5 months ago
- Ambari and Cloudera Manager in Docker☆22Mar 7, 2019Updated 6 years ago
- Compile JSON Schema into Avro and BigQuery schemas☆45Updated this week
- Integrates Marathon apps with Consul service discovery.☆198Oct 29, 2025Updated 4 months ago
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆19Jun 10, 2025Updated 8 months ago
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.☆17Jun 15, 2024Updated last year
- End-to-end DataOps platform deployed by Terraform.☆69Mar 22, 2025Updated 11 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆49Dec 23, 2025Updated 2 months ago
- Bigquery ETL☆329Updated this week
- Business intelligence, data exploration and visualization web application for Druid, formerly known as Swiv and Pivot☆762Nov 24, 2025Updated 3 months ago
- ☆21Mar 17, 2023Updated 2 years ago
- ☆20Oct 10, 2021Updated 4 years ago
- Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Cat…☆62Jan 5, 2026Updated 2 months ago
- A serverless bot which periodically checks configured BigQuery capacity commitments, reservations and assignments against actual slot con…☆26Jan 12, 2026Updated last month
- Python code to grab data from Google Search Console and send it to BigQuery.☆51Jun 12, 2025Updated 8 months ago
- Data structures & algorithms implemented in Java and solutions to leetcode problems.☆16Mar 18, 2024Updated last year
- Supercharge BigQuery with BigFunctions☆763Oct 17, 2025Updated 4 months ago
- ☆26Mar 18, 2016Updated 9 years ago
- Big Data Newsletter☆23Apr 12, 2024Updated last year
- GCE Rescue is a command-line tool to boot Google Cloud Platform VMs in Rescue Mode.☆14Dec 22, 2023Updated 2 years ago
- Schedule a data pipeline in Google Cloud using cloud function, BigQuery, cloud storage, cloud scheduler, stack trace, cloud build, and p…☆26Jun 4, 2019Updated 6 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Convert JSON schema to Google BigQuery schema☆26Feb 25, 2026Updated last week
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Jun 11, 2020Updated 5 years ago
- the open-source product analytics tool for the modern data stack☆28Oct 27, 2022Updated 3 years ago
- Utility to identify and rewrite common anti patterns in BigQuery SQL syntax☆115Aug 26, 2025Updated 6 months ago