mohaseeb / beam-nuggets
Collection of transforms for the Apache beam python SDK.
☆89Updated last year
Alternatives and similar repositories for beam-nuggets:
Users that are interested in beam-nuggets are comparing it to the libraries listed below
- Generates the BigQuery schema from newline-delimited JSON or CSV data records.☆241Updated last year
- A curated list of awesome resources for Apache Beam☆146Updated 2 years ago
- Airflow Unit Tests and Integration Tests☆256Updated 2 years ago
- ☆196Updated last year
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆89Updated 2 years ago
- ☆46Updated 8 months ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆146Updated 8 years ago
- ☆78Updated this week
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆122Updated this week
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆143Updated 7 months ago
- ☆118Updated last week
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆381Updated 3 weeks ago
- Apache Avro <-> pandas DataFrame☆136Updated 5 months ago
- Astronomer Core Docker Images☆106Updated 7 months ago
- Pylint plugin for static code analysis on Airflow code☆91Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆195Updated last month
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆155Updated this week
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 3 years ago
- Data Quality Engine for BigQuery☆263Updated 6 months ago
- ☆50Updated 4 years ago
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆52Updated 11 months ago
- A Python in-memory test stub for BigQuery☆143Updated 2 years ago
- triggering a DAG run multiple times☆85Updated 10 months ago
- SQLAlchemy dialect for BigQuery☆442Updated this week
- Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery☆64Updated 4 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆90Updated 5 months ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- pytest plugin to run the tests with support of pyspark☆84Updated 10 months ago
- Cloud Dataproc: Samples and Utils☆199Updated last week
- ☆127Updated 4 years ago