snowplow-archive / snowplow-snowflake-loader
Loads Snowplow enriched events from S3 into Snowflake
☆11Updated last year
Alternatives and similar repositories for snowplow-snowflake-loader:
Users that are interested in snowplow-snowflake-loader are comparing it to the libraries listed below
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆31Updated 2 weeks ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 3 months ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 7 months ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated 5 months ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated last year
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆75Updated 2 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- Snowplow Enrichment jobs and library☆22Updated last month
- Collector for cloud-native web, mobile and event analytics, running on AWS and GCP☆31Updated 2 weeks ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- A decisioning and response platform☆70Updated 3 years ago
- This document attempts to capture useful patterns and warn about subtle gotchas when it comes to designing and evolving schemas for long-…☆13Updated 7 years ago
- Load testing for event analytics platforms (Snowplow, more coming soon)☆13Updated 8 years ago
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated 2 years ago
- Data models for snowplow analytics.☆128Updated 2 months ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated last year
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆111Updated 5 years ago
- Performant Redshift data source for Apache Spark☆138Updated 3 months ago
- ☆33Updated last year
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated 2 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- A CLI and library to run Singer Taps and Targets☆34Updated 3 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Repository for advanced unit-testing with embedded kafka services☆25Updated 6 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆50Updated last year
- JSON schema parser for Apache Spark☆81Updated 2 years ago