pacuna / snowplow-pipelineLinks
End-to-end Snowplow Analytics Pipeline for real time events
☆29Updated 2 years ago
Alternatives and similar repositories for snowplow-pipeline
Users that are interested in snowplow-pipeline are comparing it to the libraries listed below
Sorting:
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆483Updated this week
- A guide to running Airflow on Kubernetes☆173Updated 6 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated 9 months ago
- Docker image for dbt (data build tool).☆50Updated 3 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated last month
- ☆127Updated 5 years ago
- Data models for snowplow analytics.☆129Updated 7 months ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Loads Snowplow enriched events into Google BigQuery☆22Updated 5 months ago
- This repo helps bootstrap the infrastructures with a modern data stack on Google Cloud Platform using Terraform.☆119Updated 3 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Updated 4 years ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- A Helm chart to install Apache Airflow on Kubernetes☆289Updated this week
- The athena adapter plugin for dbt (https://getdbt.com)☆140Updated 2 years ago
- Apache Airflow integration for dbt☆406Updated last year
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- Tool to automate data quality checks on data pipelines☆254Updated 3 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- Contains all JSON Schemas, Avros and Thrifts for Iglu Central☆122Updated 3 weeks ago
- ☆202Updated 2 years ago
- Performant Redshift data source for Apache Spark☆142Updated 3 months ago
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated last week
- Airflow Unit Tests and Integration Tests☆260Updated 2 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆347Updated 7 years ago
- Docker images for Snowplow, Iglu and associated projects☆61Updated 4 years ago
- How to Automate SQL: dbt(data build tool) tutorial on bigquery with extensive NOTES☆33Updated 2 years ago
- Data Pipeline Framework using the singer.io spec☆654Updated last week
- ☆80Updated 5 months ago