astra-sim / libraLinks
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
☆10Updated last year
Alternatives and similar repositories for libra
Users that are interested in libra are comparing it to the libraries listed below
Sorting:
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆21Updated last month
- ☆13Updated last year
- A Cycle-level simulator for M2NDP☆27Updated last month
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆52Updated 9 months ago
- ☆36Updated last year
- ☆23Updated 2 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆51Updated last year
- ☆74Updated 4 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆73Updated last year
- ☆24Updated 4 years ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆63Updated last month
- Sharing the codebase and steps for artifact evaluation/reproduction for MICRO 2024 paper☆9Updated 9 months ago
- Repository for MLCommons Chakra schema and tools☆39Updated last year
- ☆26Updated 4 years ago
- ☆148Updated 11 months ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆117Updated this week
- LLM Inference analyzer for different hardware platforms☆69Updated last week
- ☆36Updated last month
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆10Updated 5 months ago
- LLM serving cluster simulator☆102Updated last year
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆24Updated 3 weeks ago
- ☆19Updated 6 months ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆32Updated last year
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆34Updated 5 months ago
- ☆14Updated 2 months ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Updated last year
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆83Updated 11 months ago
- Repository for MLCommons Chakra schema and tools☆101Updated 2 months ago
- MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator☆55Updated 3 years ago
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆29Updated this week