DataTorrent / Apex-old
☆37Updated this week
Related projects: ⓘ
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Cascading on Apache Flink®☆54Updated 7 months ago
- ☆12Updated this week
- ☆28Updated this week
- ☆29Updated this week
- ☆15Updated this week
- ☆27Updated this week
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 2 years ago
- A Cascading Workflow Visualizer☆83Updated last year
- Fork of Cloudera Impala separated from Hadoop☆42Updated 8 years ago
- Simplify getting Zeppelin up and running☆56Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 7 years ago
- ☆57Updated this week
- A utility for generating Oozie workflows from a YAML definition☆48Updated 5 years ago
- ☆26Updated this week
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 8 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 6 years ago
- ☆18Updated this week
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 8 years ago
- something to help you spark☆65Updated 5 years ago
- ☆52Updated this week
- ☆110Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Pig on Apache Spark☆83Updated 9 years ago
- Starter examples to writes distributed fault-tolerant YARN applications☆9Updated 9 years ago
- Muppet☆126Updated 3 years ago