microsoft / PeregrineLinks
Peregrine is a workload optimization platform for cloud query engines. The goal of Peregrine is three-fold: 1. make it easier to ingest and analyze query workload telemetry into a common engine-agnostic representation, 2. help developers to quickly build workload optimization applications to reduce overall costs and improve operational efficien…
☆22Updated 5 years ago
Alternatives and similar repositories for Peregrine
Users that are interested in Peregrine are comparing it to the libraries listed below
Sorting:
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆427Updated 3 years ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆48Updated 3 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆82Updated this week
- MLOS is a project to enable autotuning for systems.☆165Updated 2 months ago
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆63Updated 10 months ago
- TPC-H dbgen☆313Updated 2 years ago
- tpch-dbgen☆38Updated 13 years ago
- A modular acceleration toolkit for big data analytic engines☆67Updated last year
- AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure☆151Updated 4 years ago
- FishStore is a prototype fast ingestion and querying layer for flexible-schema data☆228Updated last year
- TPC-H queries in Apache Spark SQL using native DataFrames API☆98Updated last year
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- TPC-DS queries☆62Updated 10 years ago
- Generate big TPC-DS datasets with Databricks☆20Updated 3 years ago
- Lakehouse storage system benchmark☆76Updated 2 years ago
- Telemetry and logs generator for benchmarks☆21Updated 3 years ago
- Cache File System optimized for columnar formats and object stores☆185Updated 3 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- BI benchmark with user generated data and queries☆71Updated 8 months ago
- Parquet file generator☆22Updated 7 years ago
- Self regulation and auto-tuning for distributed system☆66Updated 2 years ago
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆238Updated 7 months ago
- Snowflake dataset containing statistics for 70 million queries over 14 day period☆114Updated 3 years ago
- Tools for running benchmarks against Citus☆40Updated 2 months ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆260Updated 7 years ago
- Pytest plugin for writing Azure Data Factory Integration Tests☆25Updated 3 years ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆25Updated 7 years ago
- Apache Quickstep Incubator - This project is retired☆95Updated 6 years ago
- ☆38Updated 4 years ago