astra-sim / libraLinks
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
☆11Updated last year
Alternatives and similar repositories for libra
Users that are interested in libra are comparing it to the libraries listed below
Sorting:
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆25Updated last month
- ☆14Updated last year
- A Cycle-level simulator for M2NDP☆28Updated 2 months ago
- ☆24Updated 7 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆76Updated 2 months ago
- Sharing the codebase and steps for artifact evaluation/reproduction for MICRO 2024 paper☆9Updated 10 months ago
- ☆168Updated last year
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆123Updated last month
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆87Updated last year
- ☆144Updated 5 months ago
- ☆75Updated 4 years ago
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆31Updated last week
- ☆24Updated 2 years ago
- ☆37Updated last year
- ☆143Updated last year
- ☆45Updated 3 weeks ago
- Repository for MLCommons Chakra schema and tools☆113Updated last month
- ☆77Updated last year
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆393Updated last month
- MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator☆55Updated 4 years ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆54Updated this week
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆53Updated 11 months ago
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆46Updated last year
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆31Updated last year
- Repository for MLCommons Chakra schema and tools☆39Updated last year
- ☆33Updated last year
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆42Updated last year
- LLM Inference analyzer for different hardware platforms☆79Updated last week
- Performance Prediction Toolkit for GPUs☆37Updated 3 years ago
- ☆27Updated 4 years ago