Cloud Native Benchmarking of Foundation Models
☆45Jul 31, 2025Updated 10 months ago
Alternatives and similar repositories for fmperf
Users that are interested in fmperf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 8 months ago
- Community maintained hardware plugin for vLLM on Spyre☆52Updated this week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆18May 26, 2026Updated 2 weeks ago
- ☆20Updated this week
- Knative benchmark suite for Quarkus☆11Feb 5, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tool to detect infrastructure issues on cloud native AI systems☆53Sep 18, 2025Updated 8 months ago
- WeChat official account crawler 微信公众号爬虫☆13Apr 13, 2024Updated 2 years ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- ☆17May 8, 2020Updated 6 years ago
- Variant optimization autoscaler for distributed inference workloads☆44Updated this week
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆221Sep 21, 2024Updated last year
- Experimental Linear Algebra Performance Studies☆12Feb 24, 2017Updated 9 years ago
- Systematic and comprehensive benchmarks for LLM systems.☆59Jan 28, 2026Updated 4 months ago
- An experimental framework for temporal verification based on first-order linear-time temporal logic. Our goal is to express transition sy…☆23Mar 29, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆18Nov 7, 2025Updated 7 months ago
- ☆19Dec 4, 2025Updated 6 months ago
- ☆10Dec 10, 2024Updated last year
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- Dynamic configuration management for Kubernetes☆27Updated this week
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- Pytorch implementation for the pilot study on the robustness of latent diffusion models.☆12Jun 20, 2023Updated 2 years ago
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆32Updated this week
- An open source benchmarking framework for IT automation☆421Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- scalable data movement in Exascale Supercomputers☆19Mar 30, 2026Updated 2 months ago
- CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift (FSE 2025)☆14May 19, 2025Updated last year
- Helm charts for llm-d☆52Jul 22, 2025Updated 10 months ago
- GenAI inference performance benchmarking tool☆195Updated this week
- Collect information about 2018 CS courses in CSE of SYSU.☆11Jun 29, 2022Updated 3 years ago
- ☆17Mar 5, 2026Updated 3 months ago
- Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, NSys, and an interactive Explorer.☆118Apr 17, 2026Updated last month
- Gateway API Inference Extension☆688Updated this week
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository manifests set which is made to build a prototype system of TraceZip, made by 4 pieces.☆14Jul 17, 2025Updated 10 months ago
- GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM☆182Jul 12, 2024Updated last year
- ⚠️ ARCHIVED - All development moved to https://github.com/itbench-hub/ITBench-CISO-SRE-FinOps-Agent☆21Sep 9, 2025Updated 9 months ago
- Simulator for the datacenter, including power, cooling, server and other components☆18Feb 12, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30Updated this week
- ☆10Jun 4, 2024Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆88Jun 5, 2026Updated last week