daplab / yarn-starter
Starter examples to writes distributed fault-tolerant YARN applications
☆9Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for yarn-starter
- Cascading on Apache Flink®☆54Updated 9 months ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- A HBase schema manager using XML based table definition files.☆68Updated 2 years ago
- Mirror of Apache Slider☆79Updated 5 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 10 years ago
- Mirror of Apache Spark☆57Updated 9 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 6 years ago
- Multidimensional data storage with rollups for numerical data☆265Updated 10 months ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- Mirror of Apache DirectMemory☆53Updated 11 months ago
- ☆33Updated 8 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Mirror of Apache Sentry☆34Updated 5 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Recipes and examples for Apache Spark☆13Updated 9 years ago
- A simple storm performance/stress test☆76Updated last year
- Fast JVM collection☆59Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 7 years ago
- XPath likeness for Avro☆35Updated last year
- Flink performance tests☆29Updated 5 years ago