ercoppa / HadoopInternals
Diagrams describing Apache Hadoop internals (2.3.0 or later).
☆431Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for HadoopInternals
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆552Updated 3 years ago
- ☆245Updated 6 years ago
- The Internals of Spark Structured Streaming☆416Updated last year
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- Examples for learning spark☆333Updated 9 years ago
- Connect Spark to HBase for reading and writing data with ease☆297Updated 6 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆633Updated 11 months ago
- Benchmark Suite for Apache Spark☆238Updated last year
- Learning to write Spark examples☆160Updated 10 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆321Updated 2 years ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆289Updated last year
- Examples for High Performance Spark☆503Updated 2 weeks ago
- Explore the project Tungsten☆1Updated 8 years ago
- The Internals of Spark SQL☆456Updated this week
- Learning notes of Apache Spark source code☆73Updated 9 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆525Updated 5 years ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆881Updated 4 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,010Updated 2 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆277Updated last year
- Spark RDD to read, write and delete from HBase☆276Updated 3 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆280Updated 5 years ago