wayfair-incubator / avro-to-bigquery
☆11Updated 2 weeks ago
Related projects: ⓘ
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Updated 3 years ago
- BigQuery UDF Marshall/Unmarshall Protocolbuffers☆10Updated last year
- Contains example dags and terraform code to create a composer with a node pool to run pods☆13Updated 3 years ago
- GCP Workflows visual editor☆13Updated 2 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 6 months ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 2 years ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 4 months ago
- An open source library for BigQuery testing.☆14Updated 2 years ago
- How to Automate SQL: dbt(data build tool) tutorial on bigquery with extensive NOTES☆30Updated 11 months ago
- Loads Snowplow enriched events into Google BigQuery☆21Updated this week
- This tool generates emulated data stream based on the NYC Taxi & Limousine Commission’s open dataset expanded with additional routing inf…☆13Updated 6 years ago
- The ZetaSQL Toolkit is a library that helps users use ZetaSQL Java API to perform SQL analysis for multiple query engines, including BigQ…☆38Updated this week
- ☆33Updated 5 months ago
- A tool to import large datasets to BigQuery with automatic schema detection.☆26Updated 5 years ago
- ☆46Updated 4 months ago
- Convert JSON schema to Google BigQuery schema☆23Updated last week
- ffmpeg for market data☆35Updated this week
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆12Updated 2 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-reservation☆22Updated 11 months ago
- Running Python Code in BigQuery UDFs☆23Updated 3 years ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 2 years ago
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆17Updated 3 months ago
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- ☆11Updated last month
- BigQuery Google Storage Based Data Loader☆56Updated 4 months ago
- BigQuery Schema Conversion Tool☆23Updated 3 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 2 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated last month