microsoft / PeregrineLinks
Peregrine is a workload optimization platform for cloud query engines. The goal of Peregrine is three-fold: 1. make it easier to ingest and analyze query workload telemetry into a common engine-agnostic representation, 2. help developers to quickly build workload optimization applications to reduce overall costs and improve operational efficien…
☆22Updated 5 years ago
Alternatives and similar repositories for Peregrine
Users that are interested in Peregrine are comparing it to the libraries listed below
Sorting:
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆432Updated 4 years ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆88Updated 3 months ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆48Updated 4 years ago
- Mirror of Apache crail (Incubating)☆151Updated 3 years ago
- Performance Analysis Tool☆78Updated 2 months ago
- TPC-H queries in Apache Spark SQL using native DataFrames API☆98Updated 2 years ago
- Lakehouse storage system benchmark☆77Updated 2 years ago
- This repository contains the code base for the Open Stream Processing Benchmark.☆55Updated 4 years ago
- Self regulation and auto-tuning for distributed system☆67Updated 2 years ago
- TPC-DS queries☆64Updated 10 years ago
- A modular acceleration toolkit for big data analytic engines☆67Updated last year
- tpch-dbgen☆38Updated 13 years ago
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆31Updated this week
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 6 years ago
- TPC-H dbgen☆324Updated 2 years ago
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆72Updated last year
- ☆20Updated 5 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Point-in-Time optimizations for Apache Spark☆30Updated 2 years ago
- Spark Terasort☆121Updated 2 years ago
- Hadoop utility jar for troubleshooting integration with cloud object stores☆37Updated 2 weeks ago
- Community Java bindings for https://github.com/facebookincubator/velox☆39Updated this week
- Tools for running benchmarks against Citus☆43Updated 7 months ago
- A stateful serverless platform☆245Updated 3 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- Code for Ernest☆34Updated 2 years ago
- Cache File System optimized for columnar formats and object stores☆187Updated 3 years ago
- TPC-DS benchmark kit with some modifications/fixes☆356Updated last year
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆258Updated 6 years ago
- Window-Based Hybrid CPU/GPU Stream Processing Engine☆42Updated 3 years ago