microsoft / Peregrine
Peregrine is a workload optimization platform for cloud query engines. The goal of Peregrine is three-fold: 1. make it easier to ingest and analyze query workload telemetry into a common engine-agnostic representation, 2. help developers to quickly build workload optimization applications to reduce overall costs and improve operational efficien…
☆22Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Peregrine
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆47Updated 2 weeks ago
- Self regulation and auto-tuning for distributed system☆64Updated last year
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆69Updated this week
- Lakehouse storage system benchmark☆66Updated last year
- tpch-dbgen☆35Updated 12 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆424Updated 2 years ago
- Mirror of Apache crail (Incubating)☆148Updated 2 years ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆43Updated 3 years ago
- BI benchmark with user generated data and queries☆64Updated 5 years ago
- TPCDS benchmark for various engines☆18Updated 2 years ago
- Spark Terasort☆123Updated last year
- FishStore is a prototype fast ingestion and querying layer for flexible-schema data☆214Updated last year
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆103Updated 3 years ago
- Microsoft's contributions for Spark with Apache Accumulo☆19Updated 4 years ago
- A modular acceleration toolkit for big data analytic engines☆67Updated 6 months ago
- ☆36Updated last year
- Fast I/O plugins for Spark☆41Updated 3 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆71Updated 6 years ago
- MLOS is a project to enable autotuning for systems.☆141Updated this week
- TPC-H dbgen☆286Updated last year
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆127Updated last month
- TPC-DS queries☆56Updated 9 years ago
- Code for Ernest☆32Updated last year
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆21Updated this week
- AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure☆150Updated 3 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Updated 9 years ago
- ☆77Updated this week
- ☆104Updated last year