walmartlabs / mupd8
Muppet
☆126Updated 3 years ago
Alternatives and similar repositories for mupd8:
Users that are interested in mupd8 are comparing it to the libraries listed below
- Cascading on Apache Flink®☆54Updated last year
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Docker containers for Druid nodes☆27Updated 8 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆62Updated last year
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Llama - Low Latency Application MAster☆34Updated 2 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- A utility for generating Oozie workflows from a YAML definition☆48Updated 6 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Apache Beam Site☆29Updated 3 weeks ago
- Periscope brings SLA policy based autoscaling to Hadoop☆35Updated 9 years ago
- A Cascading Workflow Visualizer☆83Updated last year
- CDAP Cube Dataset Guide☆12Updated 7 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Read druid segments from hadoop☆10Updated 8 years ago
- Multidimensional data storage with rollups for numerical data☆266Updated last year
- Integration for Cascading and Apache Hive☆26Updated 7 years ago
- Sparking Using Java8☆17Updated 10 years ago
- ☆39Updated 8 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆74Updated 8 years ago
- demo clients☆20Updated 7 years ago