Cloud Native Benchmarking of Foundation Models
☆45Jul 31, 2025Updated 8 months ago
Alternatives and similar repositories for fmperf
Users that are interested in fmperf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 6 months ago
- Community maintained hardware plugin for vLLM on Spyre☆50Updated this week
- ☆20Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆53Sep 18, 2025Updated 6 months ago
- WeChat official account crawler 微信公众号爬虫☆12Apr 13, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Variant optimization autoscaler for distributed inference workloads☆37Updated this week
- Systematic and comprehensive benchmarks for LLM systems.☆54Jan 28, 2026Updated 2 months ago
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆16Nov 7, 2025Updated 5 months ago
- ☆10Dec 10, 2024Updated last year
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- ☆16May 27, 2025Updated 10 months ago
- Dynamic configuration management for Kubernetes☆26Updated this week
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- Platform for analyzing and recommending Python packages and Python software stacks not only for AI/ML applications☆15Jun 29, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An open source benchmarking framework for IT automation☆314Updated this week
- Pytorch implementation for the pilot study on the robustness of latent diffusion models.☆12Jun 20, 2023Updated 2 years ago
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆28Apr 1, 2026Updated last week
- scalable data movement in Exascale Supercomputers☆19Mar 30, 2026Updated 2 weeks ago
- The living Trust and Safety User Guide for the AI Alliance (https://thealliance.ai)☆15Apr 6, 2026Updated last week
- ☆29Updated this week
- Helm charts for llm-d☆52Jul 22, 2025Updated 8 months ago
- Collect information about 2018 CS courses in CSE of SYSU.☆11Jun 29, 2022Updated 3 years ago
- Unofficial OpenShift 4 Toolbox☆10Jan 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Official Tensorflow implementation for "Improving the Transferability of Adversarial Samples by Path-Augmented Method" (CVPR 2023).☆12Jun 16, 2023Updated 2 years ago
- This repository manifests set which is made to build a prototype system of TraceZip, made by 4 pieces.☆14Jul 17, 2025Updated 8 months ago
- GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM☆181Jul 12, 2024Updated last year
- Code repository for SRE agent as part of ITBench☆19Sep 9, 2025Updated 7 months ago
- Simulator for the datacenter, including power, cooling, server and other components☆17Feb 12, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆27Updated this week
- ☆17May 29, 2025Updated 10 months ago
- Health checks for Azure N- and H-series VMs.☆57Feb 5, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- ☆18Apr 3, 2026Updated last week
- MPI Benchmark on AWS HPC cluster☆20Jan 31, 2020Updated 6 years ago
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- ☆30Mar 24, 2026Updated 3 weeks ago
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago