microsoft / PeregrineLinks
Peregrine is a workload optimization platform for cloud query engines. The goal of Peregrine is three-fold: 1. make it easier to ingest and analyze query workload telemetry into a common engine-agnostic representation, 2. help developers to quickly build workload optimization applications to reduce overall costs and improve operational efficien…
☆22Updated 4 years ago
Alternatives and similar repositories for Peregrine
Users that are interested in Peregrine are comparing it to the libraries listed below
Sorting:
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆113Updated 3 years ago
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆58Updated 7 months ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆77Updated this week
- A modular acceleration toolkit for big data analytic engines☆68Updated last year
- BI benchmark with user generated data and queries☆66Updated 6 months ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆47Updated 3 years ago
- Self regulation and auto-tuning for distributed system☆65Updated 2 years ago
- tpch-dbgen☆38Updated 13 years ago
- Java bindings for https://github.com/facebookincubator/velox☆28Updated this week
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- Lakehouse storage system benchmark☆75Updated 2 years ago
- ☆38Updated last year
- MLOS is a project to enable autotuning for systems.☆162Updated this week
- ☆35Updated last year
- A stateful serverless platform☆242Updated 2 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆425Updated 3 years ago
- Rheem - a cross-platform data processing system☆5Updated 3 years ago
- Star Schema Benchmark data set generator (dbgen) - unified repository☆36Updated 2 months ago
- ☆85Updated this week
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆250Updated 4 years ago
- Reproducing TPC-DS qualification/reference results☆32Updated last year
- Albis: High-Performance File Format for Big Data Systems☆21Updated 6 years ago
- TPC-H queries in Apache Spark SQL using native DataFrames API☆99Updated last year
- ☆24Updated last month
- TPC-DS queries☆61Updated 10 years ago
- Elastic ephemeral storage☆119Updated 3 years ago
- Generic driver for LDBC Graphalytics implementation☆83Updated 6 months ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆73Updated 7 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆127Updated 6 months ago
- Spark Terasort☆121Updated 2 years ago