jmarkham / yarn-book
Code samples for the book
☆40Updated 11 years ago
Alternatives and similar repositories for yarn-book:
Users that are interested in yarn-book are comparing it to the libraries listed below
- Large scale query engine benchmark☆99Updated 8 years ago
- Spark Terasort☆122Updated last year
- Example code for Kudu☆77Updated 6 years ago
- Mirror of Apache Slider☆78Updated 6 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆56Updated 7 years ago
- An extension of Yahoo's Benchmarks☆107Updated last year
- TPC-DS Kit for Impala☆171Updated 10 months ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆127Updated 3 months ago
- Apache Flink™ training material website☆78Updated 4 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- ☆57Updated 6 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year
- Example of use of Spark Streaming with Kafka☆90Updated 10 years ago
- Mirror of Apache Spark☆57Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- spark summit 2017 SanFrancisco☆97Updated 7 years ago
- Apache Calcite Tutorial☆33Updated 8 years ago
- Benchmark Suite for Apache Spark☆239Updated last year
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- Mirror of Apache Lens☆60Updated 5 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 7 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 7 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated last year
- Mirror of Apache Crunch (Incubating)☆104Updated 4 years ago
- Learning to write Spark examples☆160Updated 10 years ago
- ☆56Updated 4 years ago