TPC-H queries in Apache Spark SQL using native DataFrames API
☆98Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for tpch-spark
Users that are interested in tpch-spark are comparing it to the libraries listed below
Sorting:
- Source code for TPCx-BB benchmark for Hive and SparkSQL on scale factor of 300 GB☆10Jun 26, 2018Updated 7 years ago
- Use the TPC-DS benchmark to test Spark SQL performance☆184Apr 27, 2020Updated 5 years ago
- A Memory-Disaggregated Managed Runtime.☆67Aug 28, 2021Updated 4 years ago
- Running TPC-H on Apache Hive☆41Jul 15, 2019Updated 6 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Nov 2, 2017Updated 8 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Nov 12, 2015Updated 10 years ago
- ☆23May 2, 2024Updated last year
- ☆10Aug 28, 2018Updated 7 years ago
- Scala Mison implementation☆15Nov 16, 2018Updated 7 years ago
- ☆19Mar 24, 2018Updated 7 years ago
- ☆17Oct 24, 2018Updated 7 years ago
- Prefetching and efficient data path for memory disaggregation☆69Jul 16, 2020Updated 5 years ago
- Midas is a memory management system that efficiently and safely harvests idle memory for applications' soft state.☆11Oct 30, 2024Updated last year
- 分类模型☆15Apr 19, 2018Updated 7 years ago
- ☆14Aug 23, 2015Updated 10 years ago
- Example of reading/writing Excel files from Pandas/Python☆14Dec 10, 2014Updated 11 years ago
- Simulation infrastructure and validation of Cori☆13Mar 22, 2022Updated 3 years ago
- Port of TPC-DS data generator to Java☆13Aug 1, 2017Updated 8 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆646Dec 17, 2023Updated 2 years ago
- Apache Spark TPC-DS benchmark setup with EMR launch setup☆17Jul 11, 2022Updated 3 years ago
- A GPU Cluster Simulator for Distributed Deep Learning Training.☆11Jan 15, 2022Updated 4 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Sep 27, 2016Updated 9 years ago
- TPC-DS benchmarks including data generation with Spark and queries with Spark☆14May 8, 2017Updated 8 years ago
- Parquet file generator☆22Apr 17, 2018Updated 7 years ago
- ☆16May 9, 2018Updated 7 years ago
- Source code for 'Practical Graph Analytics with Apache Giraph' by Roman Shaposhnik, Claudio Martella, and Dionysios Logothetis☆12Mar 28, 2017Updated 8 years ago
- ☆21Apr 17, 2024Updated last year
- ☆131Jan 10, 2019Updated 7 years ago
- TPC-H benchmark, specific for mysql☆25Apr 18, 2013Updated 12 years ago
- Large scale query engine benchmark☆99Apr 5, 2016Updated 9 years ago
- tpch-dbgen☆38Jun 24, 2012Updated 13 years ago
- Factorized Incremental View Maintenance for Queries and Analytics☆22Dec 17, 2025Updated 3 months ago
- SnailTrail implementation☆40Apr 12, 2019Updated 6 years ago
- Benchmark Suite for Apache Spark☆240Apr 12, 2023Updated 2 years ago
- HiBench is a big data benchmark suite.☆1,491Dec 15, 2025Updated 3 months ago
- TPC-DS Generation, Execution and Analyzer for Postgres☆20Dec 6, 2022Updated 3 years ago
- blah☆35May 5, 2019Updated 6 years ago
- Rust cloud object storage tools☆12Aug 9, 2021Updated 4 years ago
- TPC-DS queries☆65Jun 17, 2015Updated 10 years ago