aliyun / SimAI
☆474Updated last week
Alternatives and similar repositories for SimAI:
Users that are interested in SimAI are comparing it to the libraries listed below
- ☆195Updated 3 months ago
- ☆41Updated 5 months ago
- ☆57Updated 2 months ago
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆342Updated 2 weeks ago
- Curated collection of papers in machine learning systems☆292Updated 2 weeks ago
- FlagPerf is an open-source software platform for benchmarking AI chips.☆328Updated 2 months ago
- Repository for MLCommons Chakra schema and tools☆95Updated last month
- ☆14Updated 2 months ago
- This repository is established to store personal notes and annotated papers during daily research.☆117Updated this week
- DeepSeek-V3/R1 inference performance simulator☆111Updated 3 weeks ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆72Updated last year
- Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…☆85Updated last week
- GLake: optimizing GPU memory management and IO transmission.☆456Updated 3 weeks ago
- LLM serving cluster simulator☆96Updated 11 months ago
- An interference-aware scheduler for fine-grained GPU sharing☆132Updated 2 months ago
- ☆285Updated last year
- ☆51Updated 9 months ago
- ☆133Updated last year
- A large-scale simulation framework for LLM inference☆363Updated 5 months ago
- Artifacts for our NSDI'23 paper TGS☆75Updated 10 months ago
- Disaggregated serving system for Large Language Models (LLMs).☆559Updated last week
- paper and its code for AI System☆293Updated this week
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆885Updated this week
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆221Updated 6 months ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆160Updated 6 months ago
- NS3 simulator for RDMA over Converged Ethernet v2 (RoCEv2), including the implementation of DCQCN, TIMELY, PFC, ECN and shared buffer swi…☆294Updated 6 years ago
- NS3 simulator for RDMA load balancing☆57Updated 5 months ago
- ☆176Updated 2 years ago
- ☆23Updated 9 months ago
- ☆55Updated last year