snowplow-archive / snowplow-dockerLinks
Docker images for Snowplow, Iglu and associated projects
☆61Updated 4 years ago
Alternatives and similar repositories for snowplow-docker
Users that are interested in snowplow-docker are comparing it to the libraries listed below
Sorting:
- An easily-deployable, single-instance version of Snowplow☆129Updated 2 weeks ago
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆214Updated this week
- Contains all JSON Schemas, Avros and Thrifts for Iglu Central☆121Updated this week
- Data models for snowplow analytics.☆129Updated last month
- A decisioning and response platform☆70Updated 4 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 6 months ago
- ☆54Updated 8 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Web Extension for debugging Snowplow pixels.☆47Updated last month
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 5 years ago
- Standalone application to automate testing of trackers☆52Updated 2 weeks ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆194Updated 2 months ago
- Divolte Collector☆282Updated 4 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Updated 2 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated 8 months ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated last year
- Ephemeral Hadoop clusters using Google Compute Platform☆134Updated 3 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆156Updated 6 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated last year
- A guide to running Airflow on Kubernetes☆174Updated 6 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Updated 9 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆269Updated 2 years ago
- Front-end service library for Amundsen☆278Updated 2 months ago
- Docker image for dbt (data build tool).☆50Updated 3 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated this week
- A CLI and library to run Singer Taps and Targets☆35Updated 3 years ago