wayfair-incubator / avro-to-bigquery
☆10Updated last week
Alternatives and similar repositories for avro-to-bigquery:
Users that are interested in avro-to-bigquery are comparing it to the libraries listed below
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Updated 3 years ago
- An open source library for BigQuery testing.☆14Updated 2 years ago
- The ZetaSQL Toolkit is a library that helps users use ZetaSQL Java API to perform SQL analysis for multiple query engines, including BigQ…☆41Updated last week
- BigQuery UDF Marshall/Unmarshall Protocolbuffers☆10Updated 2 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 3 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Contains example dags and terraform code to create a composer with a node pool to run pods☆13Updated 4 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated last month
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-reservation☆22Updated last year
- Apache Beam examples for running on Google Cloud Dataflow.☆30Updated 6 years ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- BigQuery Google Storage Based Data Loader☆56Updated 8 months ago
- This tool generates emulated data stream based on the NYC Taxi & Limousine Commission’s open dataset expanded with additional routing inf…☆13Updated 6 years ago
- ☆22Updated 5 years ago
- ☆46Updated 9 months ago
- A tool to import large datasets to BigQuery with automatic schema detection.☆27Updated 5 years ago
- ☆13Updated 4 months ago
- A serverless bot which periodically checks configured BigQuery capacity commitments, reservations and assignments against actual slot con…☆23Updated 2 months ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- ffmpeg for market data☆35Updated this week
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 3 months ago
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Updated 3 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- A testing framework for Trino☆26Updated 2 months ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆49Updated 3 weeks ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆42Updated last week
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Python utilities for BigQuery analyses.☆15Updated 4 years ago