Cloud Native Benchmarking of Foundation Models
☆45Jul 31, 2025Updated 9 months ago
Alternatives and similar repositories for fmperf
Users that are interested in fmperf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 8 months ago
- Community maintained hardware plugin for vLLM on Spyre☆52Updated this week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆18May 11, 2026Updated 2 weeks ago
- ☆20Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆53Sep 18, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WeChat official account crawler 微信公众号爬虫☆13Apr 13, 2024Updated 2 years ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆114Updated this week
- Variant optimization autoscaler for distributed inference workloads☆40Updated this week
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆218Sep 21, 2024Updated last year
- Systematic and comprehensive benchmarks for LLM systems.☆58Jan 28, 2026Updated 3 months ago
- An experimental framework for temporal verification based on first-order linear-time temporal logic. Our goal is to express transition sy…☆22Mar 29, 2026Updated last month
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆18Nov 7, 2025Updated 6 months ago
- ☆18Dec 4, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Dec 10, 2024Updated last year
- ☆16May 27, 2025Updated 11 months ago
- Python Script to Open SJTU Dormitory Smart Lock☆10Sep 12, 2022Updated 3 years ago
- Dynamic configuration management for Kubernetes☆27May 18, 2026Updated last week
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- ☆12Apr 4, 2022Updated 4 years ago
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆29May 8, 2026Updated 2 weeks ago
- ☆50Updated this week
- scalable data movement in Exascale Supercomputers☆19Mar 30, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift (FSE 2025)☆13May 19, 2025Updated last year
- ☆29Updated this week
- The living Trust and Safety User Guide for the AI Alliance (https://thealliance.ai)☆16Updated this week
- Helm charts for llm-d☆52Jul 22, 2025Updated 10 months ago
- Collect information about 2018 CS courses in CSE of SYSU.☆11Jun 29, 2022Updated 3 years ago
- Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, NSys, and an interactive Explorer.☆117Apr 17, 2026Updated last month
- Gateway API Inference Extension☆675May 19, 2026Updated last week
- Unofficial OpenShift 4 Toolbox☆10Jan 26, 2024Updated 2 years ago
- Official Tensorflow implementation for "Improving the Transferability of Adversarial Samples by Path-Augmented Method" (CVPR 2023).☆12Jun 16, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM☆183Jul 12, 2024Updated last year
- Code repository for SRE agent as part of ITBench☆19Sep 9, 2025Updated 8 months ago
- Simulator for the datacenter, including power, cooling, server and other components☆18Feb 12, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆29Updated this week
- ☆17May 29, 2025Updated 11 months ago
- Health checks for Azure N- and H-series VMs.☆57May 13, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week