cloudwicklabs / generator
Synthetic data generators for simulating real-time data and work loads
☆11Updated 9 years ago
Alternatives and similar repositories for generator:
Users that are interested in generator are comparing it to the libraries listed below
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- ☆24Updated 8 years ago
- ☆21Updated 9 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- A simple Python client to request NiFi REST API☆14Updated 6 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- HDF masterclass materials☆28Updated 9 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- ☆7Updated 9 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Tools to deploy Hadoop on EMC Isilon☆18Updated 8 years ago
- Apache Pig plugin for Eclipse☆12Updated 8 years ago
- A DC/OS time series demo☆62Updated 9 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- Single view demo☆14Updated 9 years ago
- Data pipeline automation tool☆26Updated last year
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- NiFi provenance reporting tasks☆14Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Traverse HDFS without jvm startup delays and directory context!! Supports multiple HDFS hosts, command line history and tab completion.☆17Updated 8 years ago
- ☆14Updated 9 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Updated 9 months ago