cloudwicklabs / generatorLinks
Synthetic data generators for simulating real-time data and work loads
☆11Updated 9 years ago
Alternatives and similar repositories for generator
Users that are interested in generator are comparing it to the libraries listed below
Sorting:
- Cloudera Director sample code☆61Updated 5 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- HDF masterclass materials☆28Updated 9 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Updated 9 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- A DC/OS time series demo☆62Updated 9 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- Docker image for apache zeppelin☆38Updated 8 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 9 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- ☆24Updated 9 years ago
- Amazon Elastic MapReduce code samples☆63Updated 9 years ago
- Single view demo☆14Updated 9 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- ☆48Updated 9 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆48Updated 6 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 5 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- This project is for examples of how to use Zeppelin. https://github.com/apache/incubator-zeppelin☆25Updated 9 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆85Updated 8 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Updated 8 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- NiFi provenance reporting tasks☆14Updated last year