snowplow / stream-collectorLinks
Collector for cloud-native web, mobile and event analytics, running on AWS and GCP
☆35Updated last month
Alternatives and similar repositories for stream-collector
Users that are interested in stream-collector are comparing it to the libraries listed below
Sorting:
- Snowplow Enrichment jobs and library☆26Updated 2 weeks ago
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆30Updated 7 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆96Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated 10 months ago
- SparkSQL utils for ScalaPB☆43Updated 5 months ago
- Kafka Connector for Iceberg tables☆16Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 7 months ago
- a curated list of awesome lakehouse frameworks, applications, etc☆36Updated last week
- ☆80Updated 6 months ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Updated last year
- A library that provides useful extensions to Apache Spark and PySpark.☆230Updated last week
- A library that brings useful functions from various modern database management systems to Apache Spark☆60Updated 2 years ago
- Avro SerDe for Apache Spark structured APIs.☆236Updated 5 months ago
- Snowflake Kafka Connector (Sink Connector)☆160Updated this week
- A leightweight UI for Lakekeeper☆15Updated last week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94Updated 6 months ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆113Updated 5 years ago
- Multi-hop declarative data pipelines☆122Updated this week
- Extensible streaming ingestion pipeline on top of Apache Spark☆46Updated 3 months ago
- Snowflake Data Source for Apache Spark.☆230Updated 3 weeks ago
- JSON schema parser for Apache Spark☆82Updated 3 years ago
- BigQuery connector for Apache Flink☆33Updated last week
- A dbt adapter for Decodable☆12Updated 2 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆292Updated last week
- Library to convert DBT manifest metadata to Airflow tasks☆49Updated last year
- Helm charts for Trino and Trino Gateway☆184Updated last week
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆115Updated 4 years ago
- A tool to validate data, built around Apache Spark.☆100Updated this week
- Apache Spark build compatible with AWS Glue Data Catalog.☆19Updated 4 years ago