snowplow / stream-collectorLinks
Collector for cloud-native web, mobile and event analytics, running on AWS and GCP
☆33Updated 2 months ago
Alternatives and similar repositories for stream-collector
Users that are interested in stream-collector are comparing it to the libraries listed below
Sorting:
- Snowplow Enrichment jobs and library☆24Updated last month
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆31Updated 2 months ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated 5 months ago
- Dione - a Spark and HDFS indexing library☆52Updated last year
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Updated 7 months ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆45Updated last week
- ☆22Updated 6 years ago
- Loads Snowplow enriched events from S3 into Snowflake☆11Updated last year
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 9 months ago
- Snowflake Kafka Connector (Sink Connector)☆158Updated last week
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated 7 months ago
- Standalone application to automate testing of trackers☆50Updated 3 weeks ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- ☆80Updated 2 months ago
- Data Sketches for Apache Spark☆22Updated 2 years ago
- SparkSQL utils for ScalaPB☆43Updated 2 weeks ago
- The ZetaSQL Toolkit is a library that helps users use ZetaSQL Java API to perform SQL analysis for multiple query engines, including BigQ…☆41Updated 2 weeks ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆21Updated last week
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆89Updated last month
- Aiven's S3 Sink Connector for Apache Kafka®☆70Updated 9 months ago
- Command-line app for tracking Snowplow events. Add analytics to your shell scripts and terminal sessions☆9Updated last year
- Scala API for Apache Spark SQL high-order functions☆14Updated last year
- a curated list of awesome lakehouse frameworks, applications, etc☆32Updated 4 months ago
- Traffic routing for Trino Clusters☆27Updated 2 weeks ago
- Multi-hop declarative data pipelines☆115Updated 2 weeks ago
- A testing framework for Trino☆26Updated 3 months ago