snowplow / snowplow-rdb-loader
Stores Snowplow enriched events in Redshift, Snowflake and Databricks
☆31Updated 2 months ago
Alternatives and similar repositories for snowplow-rdb-loader:
Users that are interested in snowplow-rdb-loader are comparing it to the libraries listed below
- Loads Snowplow enriched events from S3 into Snowflake☆11Updated last year
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 6 months ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 4 months ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 2 months ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Kinesis Connector for Structured Streaming☆136Updated 8 months ago
- Domain-specific language to help build and maintain AWS Data Pipelines☆26Updated 6 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- Examples for High Performance Spark☆15Updated 4 months ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated last year
- Collector for cloud-native web, mobile and event analytics, running on AWS and GCP☆31Updated 2 weeks ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 5 years ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- Template for Spark Projects☆101Updated 10 months ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30Updated 4 months ago
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆115Updated 3 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Snowplow Enrichment jobs and library☆22Updated last month
- A Giter8 template for scio☆31Updated last month
- Scala client for MaxMind Geo-IP☆86Updated last year
- Extensible streaming ingestion pipeline on top of Apache Spark☆44Updated last year
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 8 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆75Updated 11 months ago
- Data models for snowplow analytics.☆127Updated last month
- Spark connector for SFTP☆100Updated last year
- File compaction tool that runs on top of the Spark framework.☆59Updated 5 years ago
- ☆72Updated 4 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- Big Data Toolkit for the JVM☆146Updated 4 years ago