AIS-SNU / Smart-InfinityLinks

[HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System

☆49

Alternatives and similar repositories for Smart-Infinity

Users that are interested in Smart-Infinity are comparing it to the libraries listed below

Sorting:

leesou / PIM-DL-ASPLOS
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
☆33Updated last year
AIS-SNU / PID-Comm
☆27Updated 10 months ago
PSAL-POSTECH / M2NDP-public
A Cycle-level simulator for M2NDP
☆31Updated 2 months ago
upmem / upmem_llm_framework
UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.
☆36Updated 2 months ago
ranggihwang / Pregated_MoE
☆55Updated last year
Yufeng98 / CENT
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆100Updated 5 months ago
PKUZHOU / NeoMem-MICRO-2024
The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering
☆57Updated last year
AIS-SNU / GraNNDis_Artifact
[PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…
☆11Updated last year
yonsei-hpcp / pid-join
☆11Updated 5 months ago
casys-kaist / LLMServingSim
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
☆143Updated 3 months ago
platformxlab / G10
☆40Updated 2 years ago
miglopst / PIM_NDP_papers
☆65Updated 4 years ago
casys-kaist / NeuPIMs
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing
☆95Updated last year
abhibambhaniya / GenZ-LLM-Analyzer
LLM Inference analyzer for different hardware platforms
☆94Updated 3 months ago
VIA-Research / uPIMulator
☆153Updated 8 months ago
pku-liang / ArkVale
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)
☆43Updated 10 months ago
ferry-hhh / CXL-DMSim
CXL-DMSim: A Full-System CXL Disaggregated Memory Simulator With Comprehensive Silicon Validation
☆97Updated last month
scale-snu / attacc_simulator
☆97Updated last year
casys-kaist / HUVM
☆24Updated 3 years ago
SNU-ARC / flashneuron
☆39Updated 2 years ago
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆55Updated last year
PrincetonUniversity / LLMCompass
☆194Updated last year
zyqCSL / DiffKV
☆22Updated 2 weeks ago
MeshInfra / WaferLLM
WaferLLM: Large Language Model Inference at Wafer Scale
☆61Updated last week
fpgasystems / Chameleon-RAG-Acceleration
☆19Updated 4 months ago
thu-nics / UniNDP
Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆15Updated last month
VIA-Research / vTrain
☆73Updated 4 months ago
RC4ML / RPCNIC
RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]
☆11Updated 10 months ago
WangYaohuii / CXL-SSD-Sim
A Full-System Simulator for CXL-Based SSD Memory System
☆32Updated 10 months ago
sitar-lab / NeuSight
☆53Updated 4 months ago