Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
☆535Jan 27, 2026Updated last month
Alternatives and similar repositories for eventsim
Users that are interested in eventsim are comparing it to the libraries listed below
Sorting:
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆94Jan 21, 2024Updated 2 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆857Apr 16, 2022Updated 3 years ago
- Skeleton project for Apache Airflow training participants to work on.☆17Jul 9, 2020Updated 5 years ago
- ☆143May 23, 2023Updated 2 years ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆28Nov 11, 2017Updated 8 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 9 years ago
- Beginner data engineering project - batch edition☆565Jan 22, 2025Updated last year
- ☆76May 19, 2015Updated 10 years ago
- Streaming Data Simulator☆17Oct 12, 2020Updated 5 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Oct 13, 2020Updated 5 years ago
- reactive kafka client☆161Oct 31, 2020Updated 5 years ago
- Slides for our 2015 event☆13May 14, 2015Updated 10 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Oct 18, 2023Updated 2 years ago
- Secor is a service implementing Kafka log persistence☆1,858Feb 25, 2026Updated last week
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- Web-links shortener on erlang☆12Aug 22, 2011Updated 14 years ago
- Simplify getting Zeppelin up and running☆56Jul 20, 2016Updated 9 years ago
- Distributed Prometheus time series database☆1,461Mar 3, 2026Updated last week
- Mahout vector encoding for pig☆53Nov 20, 2022Updated 3 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Nov 14, 2019Updated 6 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Conway's Game of Life implemented in Scala.js☆10Mar 30, 2018Updated 7 years ago
- An integration framework that allows you to run and manage CrateDB via Apache Mesos.☆23Jan 30, 2019Updated 7 years ago
- The Data Engineering Cookbook☆14,977Jan 17, 2026Updated last month
- The leader in Customer Data Infrastructure☆7,001Updated this week
- A/B test analysis library for Ruby - performs Chi-Square tests and G-tests on A/B results☆40Jun 27, 2014Updated 11 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago
- A list of useful resources to learn Data Engineering from scratch☆3,960Jun 19, 2024Updated last year
- Apache Pinot - A realtime distributed OLAP datastore☆6,040Updated this week
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 10 years ago
- Large-scale event processing with Akka Persistence and Apache Spark☆273Jun 18, 2016Updated 9 years ago
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- ☆201Oct 10, 2023Updated 2 years ago
- A reactive time series database☆235Jun 5, 2018Updated 7 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,831Aug 26, 2022Updated 3 years ago