Cloud Native Benchmarking of Foundation Models
☆45Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for fmperf
Users that are interested in fmperf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 6 months ago
- Community maintained hardware plugin for vLLM on Spyre☆47Updated this week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated this week
- ☆20Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆52Sep 18, 2025Updated 6 months ago
- WeChat official account crawler 微信公众号爬虫☆12Apr 13, 2024Updated last year
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆71Updated this week
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- Variant optimization autoscaler for distributed inference workloads☆34Updated this week
- Systematic and comprehensive benchmarks for LLM systems.☆51Jan 28, 2026Updated last month
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆210Sep 21, 2024Updated last year
- vLLM performance dashboard☆43Apr 26, 2024Updated last year
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆16Nov 7, 2025Updated 4 months ago
- ☆19Dec 4, 2025Updated 3 months ago
- Thoughts on programming languages, compilers, optimization, and performance.☆10Jul 15, 2019Updated 6 years ago
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- Python Script to Open SJTU Dormitory Smart Lock☆10Sep 12, 2022Updated 3 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- ☆44Updated this week
- An open source benchmarking framework for IT automation☆310Updated this week
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆26Updated this week
- Pytorch implementation for the pilot study on the robustness of latent diffusion models.☆12Jun 20, 2023Updated 2 years ago
- CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift (FSE 2025)☆13May 19, 2025Updated 10 months ago
- GenAI inference performance benchmarking tool☆156Mar 16, 2026Updated last week
- Helm charts for llm-d☆52Jul 22, 2025Updated 8 months ago
- Collect information about 2018 CS courses in CSE of SYSU.☆11Jun 29, 2022Updated 3 years ago
- ☆16Mar 5, 2026Updated 2 weeks ago
- Gateway API Inference Extension☆616Updated this week
- This repository manifests set which is made to build a prototype system of TraceZip, made by 4 pieces.☆14Jul 17, 2025Updated 8 months ago
- GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM☆180Jul 12, 2024Updated last year
- Code repository for SRE agent as part of ITBench☆19Sep 9, 2025Updated 6 months ago
- Simulator for the datacenter, including power, cooling, server and other components☆17Feb 12, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆27Updated this week
- Kernel Playground - A playground to run large scale experiments on the Linux Kernel☆17Nov 8, 2025Updated 4 months ago
- ☆10Jun 4, 2024Updated last year
- Health checks for Azure N- and H-series VMs.☆57Feb 5, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- MPI Benchmark on AWS HPC cluster☆20Jan 31, 2020Updated 6 years ago
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago