pacuna / snowplow-pipelineLinks
End-to-end Snowplow Analytics Pipeline for real time events
☆29Updated 2 years ago
Alternatives and similar repositories for snowplow-pipeline
Users that are interested in snowplow-pipeline are comparing it to the libraries listed below
Sorting:
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated 6 months ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- Data models for snowplow analytics.☆129Updated 5 months ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- Docker images for Snowplow, Iglu and associated projects☆61Updated 4 years ago
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆211Updated last month
- Contains all JSON Schemas, Avros and Thrifts for Iglu Central☆122Updated this week
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated last week
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 4 months ago
- A CLI and library to run Singer Taps and Targets☆34Updated 3 years ago
- tap-postgres☆68Updated 10 months ago
- Docker image for dbt (data build tool).☆49Updated 3 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 months ago
- An easily-deployable, single-instance version of Snowplow☆129Updated last month
- Standalone application to automate testing of trackers☆50Updated last month
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- A guide to running Airflow on Kubernetes☆173Updated 6 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated last year
- Multiple node presto cluster on docker container☆123Updated 3 years ago
- Sample Airflow DAGs☆62Updated 2 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated this week
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 5 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated 2 months ago
- Airflow configuration for Telemetry☆192Updated this week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆159Updated 2 weeks ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago