MLSysOps/InfraGym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MLSysOps/InfraGym)

MLSysOps / InfraGym

Empowering LLM Agents for Real-World Computer System Optimization

☆17

Alternatives and similar repositories for InfraGym

Users that are interested in InfraGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MLSysOps / Code-Agent-Survey
View on GitHub
A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.
☆22Aug 20, 2024Updated last year
LMCache / LMIgnite
View on GitHub
☆28Jul 29, 2025Updated 11 months ago
swagshaw / Rainbow-Keywords
View on GitHub
Rainbow Keywords - Official PyTorch Implementation
☆14Jun 27, 2024Updated 2 years ago
carsonpo / safetensors.cpp
View on GitHub
Zero Dependency LibTorch Safetensors Loading and Storing in C++
☆23Jul 12, 2024Updated 2 years ago
mailliw2010 / infer-frame
View on GitHub
a ai infra framework for edge device base on nndeploy
☆18Nov 27, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NVIDIA / srt-slurm
View on GitHub
NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across speci…
☆40Updated this week
ByteDance-Seed / StragglerAnalysis
View on GitHub
☆56Apr 30, 2025Updated last year
ChunelFeng / CGraph-lite
View on GitHub
A one-page-only CGraph-API-liked DAG project.
☆28Feb 11, 2025Updated last year
Xseventh / dagflow
View on GitHub
C++数据流并行处理框架
☆24Apr 10, 2021Updated 5 years ago
Leo9660 / HedraRAG_AE
View on GitHub
Artifact Evaluation for SOSP 2025
☆21Aug 16, 2025Updated 11 months ago
romitjain / kachua-mlsys
View on GitHub
[MLSys 26] 🥇 Solution for Gated Delta Net Track of MLSys 26 Flash infer competition
☆35May 22, 2026Updated last month
Cambricon / easydk
View on GitHub
easy development kit
☆12Apr 18, 2025Updated last year
CareF / CareF-knowledge-lib
View on GitHub
Personal knowledge library
☆10Nov 9, 2017Updated 8 years ago
sykwer / callback_isolated_executor
View on GitHub
The ComponentContainer and Executor that assign a dedicated thread for each callback group.
☆10Jun 20, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
herryxu / poetry_cut
View on GitHub
古诗词分词，词向量分析，输出到excel，云图
☆10Jul 6, 2022Updated 4 years ago
eth-easl / pccheck
View on GitHub
☆12Apr 23, 2026Updated 2 months ago
alibaba / hap
View on GitHub
☆16Apr 13, 2024Updated 2 years ago
AlphaZTX / phyasgn
View on GitHub
北京大学物理学院课程作业模板
☆11Sep 30, 2022Updated 3 years ago
Montimage / maip
View on GitHub
A platform that provides users with easy access to AI services developed by Montimage and usage of explainable AI techniques (e.g., LIME,…
☆10Feb 17, 2026Updated 5 months ago
noagarcia / dresstar
View on GitHub
Video retrieval from query images
☆11Oct 10, 2017Updated 8 years ago
MSNLAB / Federated-Lifelong-Person-ReID
View on GitHub
FedSTIL: Spatial-Temporal Federated Learning for Lifelong Person Re-identification on Distributed Edges. (TCSVT'23)
☆38Dec 12, 2024Updated last year
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
yzhao062 / mmad
View on GitHub
multimodal anomaly detection
☆14Jan 17, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / Soroush
View on GitHub
Microsoft's open source max-min fair solver for cluster scheduling and traffic engineering
☆19Apr 13, 2026Updated 3 months ago
AI-Hypercomputer / gpu-recipes
View on GitHub
Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.
☆138Jul 1, 2026Updated 2 weeks ago
xinzhel / LLM-Search
View on GitHub
Survey on LLM Inference via Search (TMLR 2025)
☆15May 6, 2025Updated last year
njuxx / nju-Latex-beamer-template
View on GitHub
☆10May 26, 2020Updated 6 years ago
BreakingY / jetpack-dec-enc
View on GitHub
Jetson Video Encoding and Decoding ; Jetson Jetpack5.x视频编解码库
☆48Jan 5, 2026Updated 6 months ago
CTeX-org / learnlatex.github.io
View on GitHub
Learn LaTeX online
☆15Apr 1, 2022Updated 4 years ago
TritonNetworking / opera-sim
View on GitHub
Packet-level simulation code to model Opera and other networks from the 2020 NSDI paper "Expanding across time to deliver bandwidth effic…
☆15Jun 10, 2020Updated 6 years ago
ChenyangZhang-cs / iMLBench
View on GitHub
iMLBench is a machine learning benchmark suite targeting CPU-GPU integrated architectures.
☆11May 29, 2021Updated 5 years ago
Vilin97 / Clawristotle
View on GitHub
OpenClaw-style theorem proving
☆26Jun 11, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhenshengLee / nv_driveos
View on GitHub
learning materials of driveos from nvidia drive sdk.
☆13Jun 10, 2026Updated last month
aouedions11 / Network_Traffc_prediction
View on GitHub
☆16Feb 10, 2023Updated 3 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
nndeploy / nndeploy-workflow
View on GitHub
workflow of nndeploy
☆14Nov 5, 2025Updated 8 months ago
gogongxt / nano-sglang
View on GitHub
☆161Mar 5, 2026Updated 4 months ago
hao-ai-lab / LookaheadReasoning
View on GitHub
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
☆69Oct 31, 2025Updated 8 months ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago