TPC-DS Kit for Impala
☆170May 20, 2024Updated last year
Alternatives and similar repositories for impala-tpcds-kit
Users that are interested in impala-tpcds-kit are comparing it to the libraries listed below
Sorting:
- ☆393Jan 25, 2024Updated 2 years ago
- TPC-DS benchmark kit with some modifications/fixes☆357Apr 16, 2024Updated last year
- Testbench for experimenting with Apache Hive at any data scale.☆64Jul 10, 2017Updated 8 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Jul 27, 2016Updated 9 years ago
- Llama - Low Latency Application MAster☆35Jun 27, 2022Updated 3 years ago
- ☆10Jun 3, 2023Updated 2 years ago
- All the things about TPC-DS in Apache Spark☆109Jun 15, 2023Updated 2 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Aug 20, 2017Updated 8 years ago
- ☆16Nov 8, 2015Updated 10 years ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Dec 27, 2022Updated 3 years ago
- Sample UDF and UDAs for Impala.☆63Sep 19, 2025Updated 5 months ago
- Running TPC-H on Apache Hive☆41Jul 15, 2019Updated 6 years ago
- Generate big TPC-DS datasets with Databricks☆21Jan 3, 2022Updated 4 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- Greenplum TPC-DS benchmark☆116Jul 3, 2023Updated 2 years ago
- Apache Impala☆1,267Updated this week
- Shaded version of Apache Hadoop 2.x for Presto☆16Sep 16, 2025Updated 5 months ago
- ☆49Apr 12, 2022Updated 3 years ago
- HiBench is a big data benchmark suite.☆1,489Dec 15, 2025Updated 2 months ago
- Source code for TPCx-BB benchmark for Hive and SparkSQL on scale factor of 300 GB☆10Jun 26, 2018Updated 7 years ago
- DocGenius AI - Generative AI Chatbot for your Documents☆14Aug 14, 2025Updated 6 months ago
- Fusiondb is a simple and powerful federated database engine☆11Sep 8, 2022Updated 3 years ago
- High performance data store solution☆1,446Updated this week
- Use the TPC-DS benchmark to test Spark SQL performance☆184Apr 27, 2020Updated 5 years ago
- 项目中保留了向开源社区提交过的patch☆16Oct 22, 2017Updated 8 years ago
- Tools for Using Hadoop with OneFS☆15Aug 7, 2023Updated 2 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- Read druid segments from hadoop☆10Jan 18, 2017Updated 9 years ago
- Utilities to help HBase as a service in HDInsight Azure☆14Aug 30, 2023Updated 2 years ago
- Cloudera Manager Dashboards for Hadoop Administrators☆29Nov 12, 2015Updated 10 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Jan 11, 2024Updated 2 years ago
- 分类模型☆15Apr 19, 2018Updated 7 years ago
- Run TPC-DS against different databases including Hive, Spark SQL and IBM BigSQL☆14Jan 4, 2022Updated 4 years ago
- TPC-DS benchmarks including data generation with Spark and queries with Spark☆14May 8, 2017Updated 8 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆16Feb 18, 2026Updated 2 weeks ago
- A platform to manage the data product life cycle☆22Feb 11, 2026Updated 3 weeks ago
- DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation☆13Jun 28, 2018Updated 7 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Nov 12, 2015Updated 10 years ago
- Port of TPC-DS dsdgen to Java☆50Aug 5, 2024Updated last year