TristanBilot / bqfetch
A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing
☆27Updated last year
Alternatives and similar repositories for bqfetch:
Users that are interested in bqfetch are comparing it to the libraries listed below
- ☆34Updated last month
- Experimental MLflow plugin for Google Cloud Vertex AI☆37Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-datatransfer☆84Updated last year
- Package for dbt that allows users to train, audit and use BigQuery ML models.☆69Updated 2 months ago
- ☆120Updated this week
- ☆13Updated 9 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆55Updated last week
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆69Updated last year
- Extension dtypes for pandas corresponding to GoogleSQL data types such as DATE, TIME, and JSON.☆30Updated this week
- Make simple storing test results and visualisation of these in a BI dashboard☆44Updated last month
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 8 months ago
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆18Updated 11 months ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- ☆46Updated 9 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- [DEPRECATED] A dbt adapter for Excel.☆92Updated 3 weeks ago
- Data pipeline with dbt, Airflow, Great Expectations☆162Updated 3 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Dry run capability for dbt projects using BigQuery☆97Updated 3 weeks ago
- Example Multi-Cycle, Multi-Touch Revenue and Cost Attribution Model☆26Updated last year
- ☆35Updated 4 months ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆77Updated last month
- End-to-end DataOps platform deployed by Terraform.☆66Updated last month
- Pytest plugin for dbt core☆60Updated 3 months ago
- ☆41Updated last month
- Linear regression in SQL using dbt☆70Updated 3 months ago
- Schedule a data pipeline in Google Cloud using cloud function, BigQuery, cloud storage, cloud scheduler, stack trace, cloud build, and p…☆26Updated 5 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-analytics-data☆159Updated last year