hortonworks/hive-testbench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hortonworks/hive-testbench)

hortonworks / hive-testbench

☆392

Alternatives and similar repositories for hive-testbench

Users that are interested in hive-testbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IBM / spark-tpc-ds-performance-test
View on GitHub
Use the TPC-DS benchmark to test Spark SQL performance
☆186Apr 27, 2020Updated 6 years ago
gregrahn / tpcds-kit
View on GitHub
TPC-DS benchmark kit with some modifications/fixes
☆364Apr 16, 2024Updated 2 years ago
Intel-bigdata / HiBench
View on GitHub
HiBench is a big data benchmark suite.
☆1,484Dec 15, 2025Updated 7 months ago
apache / uniffle
View on GitHub
Uniffle is a high performance, general purpose Remote Shuffle Service.
☆452Updated this week
ExpediaGroup / waggle-dance
View on GitHub
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
☆288Jun 25, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cloudera / impala-tpcds-kit
View on GitHub
TPC-DS Kit for Impala
☆170May 20, 2024Updated 2 years ago
databricks / spark-sql-perf
View on GitHub
☆623Feb 26, 2022Updated 4 years ago
maropu / spark-tpcds-datagen
View on GitHub
All the things about TPC-DS in Apache Spark
☆111Jun 15, 2023Updated 3 years ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
apache / celeborn
View on GitHub
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆1,059Updated this week
kcheeeung / hive-benchmark
View on GitHub
Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP
☆10Jul 18, 2022Updated 4 years ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
marcelmay / hfsa
View on GitHub
Hadoop FSImage Analyzer (HFSA)
☆68Jun 24, 2026Updated last month
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
oap-project / Gluten-Trino
View on GitHub
Gluten: Plugin to Boost Trino's Performance
☆75Oct 25, 2023Updated 2 years ago
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,577Updated this week
trinodb / tpcds
View on GitHub
Port of TPC-DS dsdgen to Java
☆22Jun 23, 2026Updated last month
Kyligence / kylin-tpch
View on GitHub
Run TPCH Benchmark on Apache Kylin
☆22Jan 24, 2022Updated 4 years ago
apache / amoro
View on GitHub
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
☆1,152Updated this week
databricks / tpcds-kit
View on GitHub
TPC-DS benchmark kit with some modifications/fixes
☆107Aug 13, 2024Updated last year
Intel-bigdata / SSM
View on GitHub
Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution
☆139Jan 3, 2023Updated 3 years ago
ververica / flink-sql-benchmark
View on GitHub
☆106Jan 12, 2026Updated 6 months ago
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
apache / ranger
View on GitHub
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
☆1,068Updated this week
fayson / cdhproject
View on GitHub
hadoop各组件使用，持续更新
☆897Jan 4, 2023Updated 3 years ago
Kyligence / ssb-kylin
View on GitHub
Star Schema Benchmark Tool for Apache Kylin
☆96Aug 26, 2021Updated 4 years ago
oap-project / gazelle_plugin
View on GitHub
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
☆255Feb 21, 2023Updated 3 years ago
trinodb / benchto
View on GitHub
Framework for running macro benchmarks in a clustered environment
☆39Mar 5, 2025Updated last year
cubefs / compass
View on GitHub
Compass is a task diagnosis platform for bigdata
☆404Nov 23, 2024Updated last year
allwefantasy / sql-code-intelligence
View on GitHub
sql code autocomplete
☆45Sep 2, 2020Updated 5 years ago
apache / paimon-trino
View on GitHub
Trino Connector for Apache Paimon.
☆44May 15, 2026Updated 2 months ago
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
apache / carbondata
View on GitHub
High performance data store solution
☆1,448Jul 4, 2026Updated 3 weeks ago
yahoojapan / presto_exporter
View on GitHub
☆33Mar 30, 2021Updated 5 years ago
cartershanklin / hive-testbench
View on GitHub
Testbench for experimenting with Apache Hive at any data scale.
☆64Jul 10, 2017Updated 9 years ago
DTStack / chunjun
View on GitHub
A data integration framework
☆4,107Dec 2, 2025Updated 7 months ago
bluishglc / emr-edgenode-maker
View on GitHub
This tool can easily make / build an emr cluster edge node / client node / gateway node
☆10Jun 1, 2022Updated 4 years ago
Tencent / Firestorm
View on GitHub
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…
☆256Apr 7, 2023Updated 3 years ago
mr3project / hive-mr3
View on GitHub
Hive for MR3
☆39Updated this week