jmarkham / yarn-book
Code samples for the book
☆40Updated 11 years ago
Alternatives and similar repositories for yarn-book:
Users that are interested in yarn-book are comparing it to the libraries listed below
- Spark Terasort☆122Updated last year
- Example code for Kudu☆77Updated 6 years ago
- Large scale query engine benchmark☆99Updated 8 years ago
- Mirror of Apache Slider☆78Updated 6 years ago
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- An extension of Yahoo's Benchmarks☆107Updated last year
- Mirror of Apache Lens☆60Updated 5 years ago
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated last year
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆56Updated 7 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 8 years ago
- ☆57Updated 5 years ago
- Apache Flink™ training material website☆78Updated 4 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆48Updated 5 years ago
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 7 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- The SpliceSQL Engine☆168Updated last year
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.☆417Updated last year
- ☆56Updated 4 years ago
- ☆54Updated 10 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 7 years ago
- Plugin for Presto to allow addition of user functions easily☆116Updated 3 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Updated 9 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- a set of benchmarks to test Storm performance☆15Updated 8 years ago