microsoft / Peregrine
Peregrine is a workload optimization platform for cloud query engines. The goal of Peregrine is three-fold: 1. make it easier to ingest and analyze query workload telemetry into a common engine-agnostic representation, 2. help developers to quickly build workload optimization applications to reduce overall costs and improve operational efficien…
☆22Updated 4 years ago
Related projects: ⓘ
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆42Updated last year
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆64Updated last week
- Microsoft's contributions for Spark with Apache Accumulo☆19Updated 3 years ago
- TPC-DS benchmark kit with some modifications/fixes☆85Updated last month
- Self regulation and auto-tuning for distributed system☆64Updated last year
- Lakehouse storage system benchmark☆64Updated last year
- ☆19Updated 2 months ago
- Mirror of Apache crail (Incubating)☆147Updated 2 years ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆42Updated 2 years ago
- MLOS is a project to enable autotuning for systems.☆136Updated this week
- TPC-H queries in Apache Spark SQL using native DataFrames API☆97Updated 7 months ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆423Updated 2 years ago
- TPC-DS queries☆54Updated 9 years ago
- Hadoop utility jar for troubleshooting integration with cloud object stores☆33Updated this week
- Parquet file generator☆22Updated 6 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Updated 8 years ago
- tpch-dbgen☆32Updated 12 years ago
- ☆75Updated 2 weeks ago
- Apache Spark - A unified analytics engine for large-scale data processing☆15Updated last year
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- TPC-H dbgen☆279Updated last year
- Performance Analysis Tool☆76Updated last year
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆101Updated 2 years ago
- BI benchmark with user generated data and queries☆62Updated 5 years ago
- ☆31Updated 3 months ago
- A modular acceleration toolkit for big data analytic engines☆67Updated 4 months ago
- ☆104Updated last year
- Trisk on Flink☆17Updated 2 years ago
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆19Updated this week
- Rockset community content☆16Updated 6 months ago