TPC-H queries in Apache Spark SQL using native DataFrames API
☆98Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for tpch-spark
Users that are interested in tpch-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for TPCx-BB benchmark for Hive and SparkSQL on scale factor of 300 GB☆10Jun 26, 2018Updated 7 years ago
- Use the TPC-DS benchmark to test Spark SQL performance☆184Apr 27, 2020Updated 5 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- A Memory-Disaggregated Managed Runtime.☆67Aug 28, 2021Updated 4 years ago
- Running TPC-H on Apache Hive☆41Jul 15, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Examples of all Machine Learning Algorithm in Apache Spark☆15Nov 2, 2017Updated 8 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Nov 12, 2015Updated 10 years ago
- ☆23May 2, 2024Updated last year
- Scala Mison implementation☆15Nov 16, 2018Updated 7 years ago
- ☆19Mar 24, 2018Updated 8 years ago
- ☆17Oct 24, 2018Updated 7 years ago
- Prefetching and efficient data path for memory disaggregation☆69Jul 16, 2020Updated 5 years ago
- Midas is a memory management system that efficiently and safely harvests idle memory for applications' soft state.☆11Oct 30, 2024Updated last year
- 分类模型☆15Apr 19, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Example of reading/writing Excel files from Pandas/Python☆14Dec 10, 2014Updated 11 years ago
- Simulation infrastructure and validation of Cori☆13Mar 22, 2022Updated 4 years ago
- Port of TPC-DS data generator to Java☆13Aug 1, 2017Updated 8 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆645Dec 17, 2023Updated 2 years ago
- Apache Spark TPC-DS benchmark setup with EMR launch setup☆17Jul 11, 2022Updated 3 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Jul 10, 2017Updated 8 years ago
- The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy☆12Mar 30, 2023Updated 3 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Sep 27, 2016Updated 9 years ago
- TPC-DS benchmarks including data generation with Spark and queries with Spark☆15May 8, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Parquet file generator☆22Apr 17, 2018Updated 7 years ago
- Fastswap, a fast swap system for far memory through RDMA☆85Nov 19, 2023Updated 2 years ago
- ☆21Apr 17, 2024Updated last year
- TPC-H benchmark, specific for mysql☆25Apr 18, 2013Updated 12 years ago
- Large scale query engine benchmark☆99Apr 5, 2016Updated 10 years ago
- tpcds queries for presto☆13Oct 18, 2016Updated 9 years ago
- tpch-dbgen☆38Jun 24, 2012Updated 13 years ago
- Scheduler scoreboard is a single toolkit to capture and report all the data related to the Linux Kernel Scheduler which can help analyze …☆16Feb 14, 2025Updated last year
- SnailTrail implementation☆40Apr 12, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Benchmark Suite for Apache Spark☆240Apr 12, 2023Updated 2 years ago
- HiBench is a big data benchmark suite.☆1,489Dec 15, 2025Updated 3 months ago
- PostgreSQL-compatible TPC-H benchmark, with wrapper scripts for populating data and evaluating performance☆30Jun 28, 2016Updated 9 years ago
- TPC-DS Generation, Execution and Analyzer for Postgres☆20Dec 6, 2022Updated 3 years ago
- Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS'23)☆221Oct 13, 2024Updated last year
- TPC-DS queries☆66Jun 17, 2015Updated 10 years ago
- Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory☆38Apr 19, 2023Updated 2 years ago