Intel-bigdata/HiBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Intel-bigdata/HiBench)

Intel-bigdata / HiBench

HiBench is a big data benchmark suite.

☆1,485

Alternatives and similar repositories for HiBench

Users that are interested in HiBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhihuili / Dew
View on GitHub
☆116Jul 6, 2015Updated 11 years ago
CODAIT / spark-bench
View on GitHub
Benchmark Suite for Apache Spark
☆242Apr 12, 2023Updated 3 years ago
yahoo / streaming-benchmarks
View on GitHub
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
☆647Dec 17, 2023Updated 2 years ago
databricks / spark-sql-perf
View on GitHub
☆623Feb 26, 2022Updated 4 years ago
databricks / spark-perf
View on GitHub
Performance tests for Apache Spark
☆392Jul 9, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
asonje / PAT
View on GitHub
Performance Analysis Tool
☆78Nov 25, 2025Updated 7 months ago
hortonworks / hive-testbench
View on GitHub
☆392Jan 25, 2024Updated 2 years ago
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
brianfrankcooper / YCSB
View on GitHub
Yahoo! Cloud Serving Benchmark
☆5,226Apr 15, 2026Updated 3 months ago
IBM / spark-tpc-ds-performance-test
View on GitHub
Use the TPC-DS benchmark to test Spark SQL performance
☆186Apr 27, 2020Updated 6 years ago
ehiggs / spark-terasort
View on GitHub
Spark Terasort
☆121Apr 21, 2023Updated 3 years ago
liubin2048 / HiBench-8.0
View on GitHub
The upgrade was based on the HiBench7.0 release
☆11Oct 13, 2020Updated 5 years ago
apache / uniffle
View on GitHub
Uniffle is a high performance, general purpose Remote Shuffle Service.
☆451Updated this week
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,214Apr 29, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dataArtisans / yahoo-streaming-benchmark
View on GitHub
An extension of Yahoo's Benchmarks
☆108Dec 18, 2023Updated 2 years ago
JerryLead / SparkInternals
View on GitHub
Notes talking about the design and implementation of Apache Spark
☆5,361Apr 2, 2024Updated 2 years ago
apache / celeborn
View on GitHub
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆1,056Updated this week
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
itisaid / Doris
View on GitHub
A large scale distributed KV storage system.
☆237May 31, 2017Updated 9 years ago
apache / calcite
View on GitHub
Apache Calcite
☆5,159Updated this week
apache / kylin
View on GitHub
Apache Kylin
☆3,769Updated this week
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
apache / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆43,663Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
apache / flink
View on GitHub
Apache Flink
☆26,199Updated this week
Mellanox / SparkRDMA
View on GitHub
This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…
☆257May 13, 2019Updated 7 years ago
apache / carbondata
View on GitHub
High performance data store solution
☆1,448Jul 4, 2026Updated 2 weeks ago
apache / ambari
View on GitHub
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.
☆2,306Updated this week
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
Tencent / Firestorm
View on GitHub
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…
☆256Apr 7, 2023Updated 3 years ago
Intel-bigdata / Spark-PMoF
View on GitHub
Spark Shuffle Optimization with RDMA+AEP
☆30May 23, 2023Updated 3 years ago
prestodb / presto
View on GitHub
The official home of the Presto distributed SQL query engine for big data
☆16,719Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
byzer-org / byzer-lang
View on GitHub
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
☆1,835May 29, 2024Updated 2 years ago
apache / hive
View on GitHub
Apache Hive
☆5,995Updated this week
Intel-bigdata / SSM
View on GitHub
Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution
☆139Jan 3, 2023Updated 3 years ago
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,576Updated this week
cloudera / impala-tpcds-kit
View on GitHub
TPC-DS Kit for Impala
☆170May 20, 2024Updated 2 years ago
DTStack / flinkStreamSQL
View on GitHub
基于开源的flink，对其实时sql进行扩展；主要实现了流与维表的join，支持原生flink SQL所有的语法
☆2,052Feb 21, 2024Updated 2 years ago
nexmark / nexmark
View on GitHub
Benchmarks for queries over continuous data streams.
☆387Dec 26, 2025Updated 6 months ago