geftimov / hadoop-map-reduce-patternsLinks
Hadoop Map-Reduce Design Patterns
☆73Updated 2 years ago
Alternatives and similar repositories for hadoop-map-reduce-patterns
Users that are interested in hadoop-map-reduce-patterns are comparing it to the libraries listed below
Sorting:
- Code repository for O'Reilly Hadoop Application Architectures book☆165Updated 10 years ago
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆236Updated 8 years ago
- Repository for MapReduce Design Patterns (O'Reilly 2012) example source code☆234Updated 10 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆287Updated 8 years ago
- Simple Spark Application☆76Updated last year
- Diagrams describing Apache Hadoop internals (2.3.0 or later).☆430Updated 5 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago
- Learning to write Spark examples☆160Updated 10 years ago
- Examples for learning spark☆331Updated 9 years ago
- An implementation of a real-world map-reduce workflow in each major framework.☆151Updated 9 years ago
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆202Updated 5 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆278Updated 6 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 8 years ago
- A simple storm performance/stress test☆74Updated 2 years ago
- ☆55Updated 10 years ago
- Self-written notes that may be useful☆107Updated 9 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Apache Spark applications☆70Updated 7 years ago
- SequenceIQ Hadoop examples☆115Updated 9 years ago
- Trident-ML : A realtime online machine learning library☆382Updated last year
- Source code that accompanies the book "Hadoop in Practice, Second Edition".☆79Updated 10 years ago
- Oozie Samples☆52Updated 11 years ago
- ☆117Updated 2 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- Source code for Big Data: Principles and best practices of scalable realtime data systems☆332Updated last year
- spark + drools☆103Updated 3 years ago
- Kafka consumer emitting messages as storm tuples☆103Updated 4 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆243Updated 9 years ago
- ☆240Updated 3 years ago