Memrise / docker-snowplow
Snowplow docker containers
☆8Updated 8 years ago
Alternatives and similar repositories for docker-snowplow:
Users that are interested in docker-snowplow are comparing it to the libraries listed below
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 2 months ago
- Docker images for Snowplow, Iglu and associated projects☆61Updated 3 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 4 years ago
- Python SDK for working with Snowplow enriched events in Spark, AWS Lambda et al.☆21Updated 2 months ago
- Load testing for event analytics platforms (Snowplow, more coming soon)☆13Updated 8 years ago
- A plugin for Airflow that create and manage your DAG with web UI.☆20Updated 7 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- An Ansible role for installing Apache Spark.☆58Updated 6 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 9 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 4 months ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆31Updated last week
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Helpers & syntactic sugar for PySpark.☆61Updated last year
- Python language Plugin for elasticsearch☆103Updated 6 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 years ago
- Visualize streaming machine learning in Spark☆9Updated 9 years ago
- Load a CSV (or TSV) file into an Elasticsearch instance☆62Updated 2 years ago
- Helpful tools for monitoring Kafka Connect☆20Updated 6 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- A client for the Confluent Schema Registry API implemented in Python☆52Updated last year
- The Scalding tutorial as a standalone SBT project☆51Updated 7 years ago
- A DockerSwarm Jupyterhub setup, which uses a NFS Server running in a Docker Container for persistent storage☆20Updated 6 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Updated 5 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated 10 months ago
- Cloudbreak Deployer Tool☆34Updated last year
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago