microsoft/lst-bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/lst-bench)

microsoft / lst-bench

LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.

☆90

Alternatives and similar repositories for lst-bench

Users that are interested in lst-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xskipper-io / xskipper
View on GitHub
An Extensible Data Skipping Framework
☆50Jul 15, 2025Updated last year
lhbench / lhbench
View on GitHub
Lakehouse storage system benchmark
☆82Feb 22, 2023Updated 3 years ago
alexandervanrenen / cab
View on GitHub
A benchmark for serverless analytic databases.
☆26Jan 23, 2026Updated 6 months ago
zilliztech / kafka-connect-milvus
View on GitHub
kafka-connect-milvus sink connector
☆24Jan 23, 2026Updated 6 months ago
apache / iceberg-docs
View on GitHub
Apache Iceberg Documentation Site
☆42Feb 5, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apache / incubator-xtable
View on GitHub
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processin…
☆1,196Updated this week
ldbc / dbgen.JCC-H
View on GitHub
☆22Apr 17, 2024Updated 2 years ago
manuzhang / awesome-lakehouse
View on GitHub
a curated list of awesome lakehouse frameworks, applications, etc
☆49Mar 9, 2026Updated 4 months ago
chenhao-ye / polaris
View on GitHub
Source code for the SIGMOD '23 paper “Polaris: Enabling Transaction Priority in Optimistic Concurrency Control”
☆27Jul 10, 2023Updated 3 years ago
mjasny / vldb26-iouring
View on GitHub
☆18Mar 31, 2026Updated 3 months ago
Qbeast-io / qbeast-spark
View on GitHub
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
☆233Jan 24, 2025Updated last year
decis-bench / febench
View on GitHub
A Benchmark for Real-Time Relational Data Feature Extraction (VLDB'23 Best Industry Paper Runnerup)
☆54Sep 9, 2023Updated 2 years ago
tustvold / access-log-bench
View on GitHub
☆14Dec 8, 2022Updated 3 years ago
cwida / tpcds-result-reproduction
View on GitHub
Reproducing TPC-DS qualification/reference results
☆36Aug 16, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
utndatasystems / redbench
View on GitHub
Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations (aiDM @SIGMOD'25).
☆21May 4, 2025Updated last year
SNU-ARC / WALTZ
View on GitHub
☆13May 9, 2023Updated 3 years ago
aws-samples / emr-remote-shuffle-service
View on GitHub
☆18May 7, 2026Updated 2 months ago
ucare-uchicago / tinyTailFlash
View on GitHub
ttFlash is a "tiny-tail" flash drive (SSD) that eliminates GC-induced tail latencies by circumventing GC-blocked I/Os with RAIN.
☆12Mar 17, 2017Updated 9 years ago
dessertlab / Fault-Injection-Dataset
View on GitHub
Failure dataset accompanying the paper "How Bad Can a Bug Get? An Empirical Analysis of Software Failures in the OpenStack Cloud Computi…
☆10Jun 12, 2020Updated 6 years ago
BU-DiSC / K-V-Workload-Generator
View on GitHub
☆10Aug 25, 2025Updated 10 months ago
projectnessie / nessie
View on GitHub
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,483Updated this week
fybrik / fybrik
View on GitHub
Fybrik
☆130Sep 7, 2025Updated 10 months ago
DSM-fudan / Dumpy
View on GitHub
Dumpy: A Compact and Adaptive Index for Large Data Series Collections (SIGMOD'23)
☆13Dec 12, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CrowdStrike / kafka-replicator
View on GitHub
Kafka replicator is a tool used to mirror and backup Kafka topics across regions
☆18Feb 14, 2023Updated 3 years ago
adobe / lake-pulse
View on GitHub
A Rust library for analyzing data lake table health — checking the pulse — across multiple formats (Delta Lake, Apache Iceberg, Apache Hu…
☆20Jul 11, 2026Updated last week
hpides / autovec-db
View on GitHub
Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"
☆16Jul 5, 2023Updated 3 years ago
Stream-SQL-TCK / Stream-SQL-TCK
View on GitHub
☆13Mar 2, 2018Updated 8 years ago
databendlabs / openkv
View on GitHub
LSM based key-value store in rust, design for cloud
☆87Feb 27, 2022Updated 4 years ago
kgyrtkirk / hive-dev-box
View on GitHub
☆23Nov 5, 2024Updated last year
onehouseinc / LakeView
View on GitHub
Monitoring and insights on your data lakehouse tables
☆32Updated this week
yxymit / s3filter
View on GitHub
☆22Dec 18, 2023Updated 2 years ago
datayoga-io / datayoga
View on GitHub
streaming data pipeline platform
☆30Jun 3, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
linkedin / openhouse
View on GitHub
Open Control Plane for Tables in Data Lakehouse
☆392Updated this week
rbalamohan / tez-autobuild
View on GitHub
A Tez dev-setup for HDP2 sandbox
☆21Mar 2, 2023Updated 3 years ago
dstreev / hdfs-cli
View on GitHub
Traverse HDFS without jvm startup delays and directory context!! Supports multiple HDFS hosts, command line history and tab completion.
☆17May 20, 2016Updated 10 years ago
itrummer / DataCorrelationPredictionWithNLP
View on GitHub
This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.
☆11Aug 21, 2023Updated 2 years ago
AutomataLab / Pison
View on GitHub
Scalable Structural Index Constructor for JSON Analytics
☆27Oct 10, 2024Updated last year
cruxprotocol / js-sdk
View on GitHub
CruxPay Javascript SDK
☆11Jan 7, 2023Updated 3 years ago
tidb-incubator / vldbss-2021
View on GitHub
☆21Jul 20, 2021Updated 5 years ago