Cloud Native Benchmarking of Foundation Models
☆45Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for fmperf
Users that are interested in fmperf are comparing it to the libraries listed below
Sorting:
- Predict the performance of LLM inference services☆21Sep 18, 2025Updated 5 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆52Sep 18, 2025Updated 5 months ago
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆16Nov 7, 2025Updated 3 months ago
- Community maintained hardware plugin for vLLM on Spyre☆46Updated this week
- Variant optimization autoscaler for distributed inference workloads☆31Updated this week
- scalable data movement in Exascale Supercomputers☆17Updated this week
- ☆20Updated this week
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆22Updated this week
- Health checks for Azure N- and H-series VMs.☆57Feb 5, 2026Updated 3 weeks ago
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆58Updated this week
- MPI Benchmark on AWS HPC cluster☆20Jan 31, 2020Updated 6 years ago
- GenAI inference performance benchmarking tool☆151Updated this week
- Cloud Energy is an XGBoost & linear model based on the energy data from the SPECPower database for the cloud to estimate wattage consumpt…☆30Feb 18, 2026Updated last week
- Model Server for Kepler☆29Feb 2, 2026Updated last month
- llm-d benchmark scripts and tooling☆48Updated this week
- 凵 Full-system, queuing simulator for serverless workflows.☆25Aug 1, 2023Updated 2 years ago
- WG Serving☆34Dec 15, 2025Updated 2 months ago
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆210Sep 21, 2024Updated last year
- ☆43Updated this week
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆35Sep 12, 2025Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- Systematic and comprehensive benchmarks for LLM systems.☆51Jan 28, 2026Updated last month
- ☆11Sep 21, 2022Updated 3 years ago
- Highly ergonomic and portable helpers for terminal navigation.☆20Nov 3, 2025Updated 4 months ago
- WeChat official account crawler 微信公众号爬虫☆12Apr 13, 2024Updated last year
- Memory Topology for GPUs☆17Feb 13, 2026Updated 2 weeks ago
- vLLM performance dashboard☆42Apr 26, 2024Updated last year
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆26Feb 4, 2026Updated 3 weeks ago
- Main Kagenti repo - installer, UI and docs☆72Feb 25, 2026Updated last week
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆71Feb 6, 2026Updated 3 weeks ago
- Gateway API Inference Extension☆597Updated this week
- Kubernetes-native AI serving platform for scalable model serving.☆233Updated this week
- ☆10Updated this week
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- Python wrapper for the energy system optimization framework IESopt.☆18Feb 23, 2026Updated last week
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆14Dec 24, 2025Updated 2 months ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago