IBM/spark-tpc-ds-performance-test

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IBM/spark-tpc-ds-performance-test)

IBM / spark-tpc-ds-performance-test

Use the TPC-DS benchmark to test Spark SQL performance

☆186

Alternatives and similar repositories for spark-tpc-ds-performance-test

Users that are interested in spark-tpc-ds-performance-test are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

databricks / spark-sql-perf
View on GitHub
☆623Feb 26, 2022Updated 4 years ago
databricks / tpcds-kit
View on GitHub
TPC-DS benchmark kit with some modifications/fixes
☆107Aug 13, 2024Updated last year
gregrahn / tpcds-kit
View on GitHub
TPC-DS benchmark kit with some modifications/fixes
☆364Apr 16, 2024Updated 2 years ago
maropu / spark-tpcds-datagen
View on GitHub
All the things about TPC-DS in Apache Spark
☆111Jun 15, 2023Updated 3 years ago
ssavvides / tpch-spark
View on GitHub
TPC-H queries in Apache Spark SQL using native DataFrames API
☆99Jan 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
databricks / tpch-dbgen
View on GitHub
Patched version of dbgen
☆34Feb 25, 2024Updated 2 years ago
hortonworks / hive-testbench
View on GitHub
☆392Jan 25, 2024Updated 2 years ago
yaooqinn / tpcds-for-spark
View on GitHub
☆23May 12, 2018Updated 8 years ago
sunileman / MapReduce-Performance_Testing
View on GitHub
MapReduce performance testing using teragen and terasort
☆19Aug 26, 2021Updated 4 years ago
yuananf / tpcds-presto
View on GitHub
tpcds queries for presto
☆13Oct 18, 2016Updated 9 years ago
ehiggs / spark-terasort
View on GitHub
Spark Terasort
☆121Apr 21, 2023Updated 3 years ago
CODAIT / spark-bench
View on GitHub
Benchmark Suite for Apache Spark
☆242Apr 12, 2023Updated 3 years ago
Teradata / tpcds
View on GitHub
Port of TPC-DS dsdgen to Java
☆50Aug 5, 2024Updated last year
Intel-bigdata / HiBench
View on GitHub
HiBench is a big data benchmark suite.
☆1,484Dec 15, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
yaooqinn / spark-authorizer
View on GitHub
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆183Apr 6, 2022Updated 4 years ago
squito / spark-memory
View on GitHub
A tool to get better debug info on spark's memory usage
☆42Aug 21, 2019Updated 6 years ago
agirish / tpcds
View on GitHub
TPC-DS queries
☆65Jun 17, 2015Updated 11 years ago
databricks / spark-perf
View on GitHub
Performance tests for Apache Spark
☆392Jul 9, 2018Updated 8 years ago
Mellanox / SparkRDMA
View on GitHub
This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…
☆258May 13, 2019Updated 7 years ago
liancheng / brainsuck
View on GitHub
A simple optimizing Brainfuck compiler (used as the demo for my QCon Beijing 2015 talk)
☆61Sep 23, 2022Updated 3 years ago
kcheeeung / hive-benchmark
View on GitHub
Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP
☆10Jul 18, 2022Updated 4 years ago
hvanhovell / weld-java
View on GitHub
JVM integration for Weld
☆16Sep 24, 2018Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
flink-tpc-ds / flink-community-perf
View on GitHub
TPC-DS Performance tests tool for Flink
☆29May 21, 2021Updated 5 years ago
JerryLead / SparkProfiler
View on GitHub
Profiling Spark Applications for Performance Comparison and Diagnosis
☆16Nov 11, 2018Updated 7 years ago
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,577Updated this week
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Updated this week
cerndb / SparkPlugins
View on GitHub
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…
☆96May 11, 2026Updated 2 months ago
BBVA / spark-benchmarks
View on GitHub
Benchmarking suite for Apache Spark
☆16Nov 24, 2017Updated 8 years ago
multifacet / cbmm-artifact
View on GitHub
Artifact package for CBMM paper (ATC'22)
☆11Jun 5, 2022Updated 4 years ago
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zrlio / parquet-generator
View on GitHub
Parquet file generator
☆22Apr 17, 2018Updated 8 years ago
databricks / benchmarks
View on GitHub
A place in which we publish scripts for reproducible benchmarks.
☆105Dec 13, 2019Updated 6 years ago
liancheng / spear
View on GitHub
A playground for experimenting ideas that may apply to Spark SQL/Catalyst
☆143Jul 5, 2018Updated 8 years ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
qubole / spark-state-store
View on GitHub
Rocksdb state storage implementation for Structured Streaming.
☆17Oct 21, 2020Updated 5 years ago
hammerlab / grafana-spark-dashboards
View on GitHub
Scripts for generating Grafana dashboards for monitoring Spark jobs
☆240Mar 26, 2015Updated 11 years ago
sam1016yu / DB-Exp-Sensitivity
View on GitHub
A Study of Database Performance Sensitivity to Experiment Settings
☆11May 31, 2022Updated 4 years ago