JonathanMace / tpcdsLinks
TPC-DS benchmarks including data generation with Spark and queries with Spark
☆14Updated 8 years ago
Alternatives and similar repositories for tpcds
Users that are interested in tpcds are comparing it to the libraries listed below
Sorting:
- JVM integration for Weld☆16Updated 7 years ago
- An extension of Yahoo's Benchmarks☆108Updated 2 years ago
- Spark Terasort☆121Updated 2 years ago
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Performance Analysis Tool☆78Updated 3 weeks ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- An experimental Graph Streaming API for Apache Flink☆141Updated 5 years ago
- Cache File System optimized for columnar formats and object stores☆187Updated 3 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 9 years ago
- Fast I/O plugins for Spark☆41Updated 5 years ago
- Enabling queries on compressed data.☆281Updated 2 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆79Updated 2 years ago
- Benchmark Suite for Apache Spark☆241Updated 2 years ago
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 7 years ago
- Use the TPC-DS benchmark to test Spark SQL performance☆183Updated 5 years ago
- Scripts to analyze Spark's performance