Predict the performance of LLM inference services
☆21Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for LLM-performance-prediction
Users that are interested in LLM-performance-prediction are comparing it to the libraries listed below
Sorting:
- Simulator for the datacenter, including power, cooling, server and other components☆17Feb 12, 2025Updated last year
- LangBench applications and scripts☆14Jun 7, 2023Updated 2 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆135Feb 22, 2024Updated 2 years ago
- Serverless Paper Reading and Discussion☆38Jan 9, 2023Updated 3 years ago
- ☆19May 10, 2025Updated 9 months ago
- Releasing the spot availability traces used in "Can't Be Late" paper.☆24Mar 31, 2024Updated last year
- ☆20Sep 25, 2023Updated 2 years ago
- TraceWeaver is a research prototype for transparently tracing requests through a microservice without application instrumentation.☆23Sep 2, 2024Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- ☆175Mar 12, 2024Updated last year
- A large-scale simulation framework for LLM inference☆539Jul 25, 2025Updated 7 months ago
- ☆33Jan 14, 2025Updated last year
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆35Feb 11, 2026Updated 2 weeks ago
- 板球控制系統/滾球系統/BallPlate 2017年全国大学生电子设计竞赛B题 全国二等奖作品☆10May 27, 2024Updated last year
- ☁️ Benchmarking LLMs for Cloud Config Generation | 云场景下的大模型基准测试☆39Oct 25, 2024Updated last year
- LLTFI is a tool, which is an extension of LLFI, allowing users to run fault injection experiments on C/C++, TensorFlow and PyTorch applic…☆41Oct 4, 2024Updated last year
- LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure☆177Updated this week
- LLM Inference analyzer for different hardware platforms☆101Feb 17, 2026Updated last week
- A Framework for Automated Validation of Deep Learning Training Tasks☆62Feb 13, 2026Updated 2 weeks ago
- Code repository for scenarios and environment setup as part of ITBench☆15Feb 19, 2026Updated last week
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- ☆12Apr 14, 2025Updated 10 months ago
- Kernel Playground - A playground to run large scale experiments on the Linux Kernel☆17Nov 8, 2025Updated 3 months ago
- A CSS3 Overlay system for modal dialogs.☆66Dec 16, 2010Updated 15 years ago
- A tool to detect infrastructure issues on cloud native AI systems☆52Sep 18, 2025Updated 5 months ago
- ⚠️ [DEPRECATED] Ticketswoop is a basic puppeteer bot to purchase tickets on ticketswap☆13Oct 9, 2024Updated last year
- Code for SIGKDD2025 paper: An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem☆14Jan 28, 2025Updated last year
- use iconfont in reactNative by react-native-vector-icons☆11Sep 13, 2017Updated 8 years ago
- ☆18Mar 23, 2025Updated 11 months ago
- Shared Cheat Sheet for Coq☆10Sep 8, 2016Updated 9 years ago
- Secure and Scalable Federated Learning using Serverless Computing☆12Jan 31, 2024Updated 2 years ago
- SplitBud is a Split Learning framework built upon Flower☆14Mar 22, 2025Updated 11 months ago
- ☆16Jan 14, 2025Updated last year
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- E-commerce search benchmark is the first end-to-end application benchmark for e-commerce search system with personalized recommendations.…☆45Feb 15, 2023Updated 3 years ago
- ☆14Jan 10, 2025Updated last year
- Yad2 smart scraper with a minimal setup☆17Jun 18, 2023Updated 2 years ago
- ☆11Oct 17, 2024Updated last year
- Tool to convert JSON formatted discussion posts on Canvas LMS into HTML files - similar to saving student text-entry assignments☆13May 20, 2022Updated 3 years ago