valuko / TPCx-BB
Source code for TPCx-BB benchmark for Hive and SparkSQL on scale factor of 300 GB
☆10Updated 6 years ago
Alternatives and similar repositories for TPCx-BB:
Users that are interested in TPCx-BB are comparing it to the libraries listed below
- TPC-H queries in Apache Spark SQL using native DataFrames API☆99Updated last year
- tpch-dbgen☆38Updated 12 years ago
- DS2 is an auto-scaling controller for distributed streaming dataflows☆89Updated 2 years ago
- Mirror of Apache crail (Incubating)☆150Updated 2 years ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆112Updated 3 years ago
- Performance Analysis Tool☆76Updated 2 years ago
- A Benchmark Harness for Systematic and Robust Evaluation of Streaming State Stores☆17Updated last year
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year
- Naos: Serialization-free RDMA networking in Java☆14Updated 3 years ago
- Use the TPC-DS benchmark to test Spark SQL performance☆179Updated 4 years ago
- Optimizing data-intensive systems in disaggregated data centers☆13Updated 2 years ago
- GPU library for writing SQL queries☆73Updated 10 months ago
- All the things about TPC-DS in Apache Spark☆105Updated last year
- Spark Terasort☆122Updated 2 years ago
- ☆16Updated last year
- TPC-DS queries☆60Updated 9 years ago
- Lakehouse storage system benchmark☆73Updated 2 years ago
- ☆20Updated 4 years ago
- Benchmark suite to evaluate HTAP database engines☆22Updated 2 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆127Updated 4 months ago
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Updated last year
- Radix Hash Join - SIGMOD Contest 2018.☆33Updated 6 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆244Updated 5 years ago
- A modular acceleration toolkit for big data analytic engines☆68Updated 11 months ago
- Virtual Memory Abstraction for Serverless Architectures☆47Updated 3 years ago
- A Multicore, NUMA Optimised Data Stream Processing System☆38Updated 2 years ago
- Reducing the cache misses of SIMD vectorization using IMV☆28Updated 2 years ago
- stream processing reading list☆68Updated last year
- How to plot for papers, slides, demos, etc.☆10Updated 3 years ago
- This repository contains the code base for the Open Stream Processing Benchmark.☆50Updated 3 years ago