scale-snu/LLMSimulator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scale-snu/LLMSimulator)

scale-snu / LLMSimulator

☆56

Alternatives and similar repositories for LLMSimulator

Users that are interested in LLMSimulator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scale-snu / DyLLM
View on GitHub
☆19Updated this week
scale-snu / mcsim_public
View on GitHub
☆22Feb 26, 2023Updated 3 years ago
scale-snu / attacc_simulator
View on GitHub
☆158Jun 24, 2024Updated 2 years ago
Yufeng98 / CENT
View on GitHub
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆142May 3, 2025Updated last year
scale-snu / Sudoku
View on GitHub
A tool for decomposing DRAM address mapping into component-level functions
☆16Jun 12, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
casys-kaist / pimba
View on GitHub
Official code repository for "Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving [MICRO'25]"
☆25Oct 23, 2025Updated 8 months ago
scale-snu / ckks-gpu-core
View on GitHub
☆116Jul 4, 2024Updated 2 years ago
scale-snu / layered-prefill
View on GitHub
Layered prefill changes the scheduling axis from tokens to layers and removes redundant MoE weight reloads while keeping decode stall fre…
☆18Mar 9, 2026Updated 4 months ago
arkhadem / aim_simulator
View on GitHub
A simulator for SK hynix AiM PIM architecture based on Ramulator 2.0
☆69Jul 22, 2025Updated last year
scale-snu / AE_DRAMScope_ISCA2024
View on GitHub
☆15Apr 18, 2024Updated 2 years ago
VIA-Research / AgentBench
View on GitHub
The set of AI agent model implementations, benchmarks, and others used in our paper "The Cost of Dynamic Reasoning: Demystifying AI Agent…
☆42Mar 26, 2026Updated 3 months ago
PSAL-POSTECH / accelsim_HMS
View on GitHub
☆12Jul 2, 2024Updated 2 years ago
PSAL-POSTECH / ONNXim
View on GitHub
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆209Jan 8, 2026Updated 6 months ago
efeslab / siloz
View on GitHub
☆11Aug 23, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
VIA-Research / SwarmIO
View on GitHub
SwarmIO is an SSD emulation framework for next-generation GPU-centric storage systems research
☆52May 24, 2026Updated last month
harvard-acc / DreamRAM
View on GitHub
DreamRAM: A Fine-Grained Configurable Design Space Modeling Tool for Custom 3D Die-Stacked DRAM
☆16Apr 28, 2026Updated 2 months ago
georgia-tech-synergy-lab / SparseAccelerator-RTL
View on GitHub
Accelerator RTL inspired by VEGETA [HPCA'23] and MicroScopiQ [ISCA'25]
☆15Nov 11, 2025Updated 8 months ago
casys-kaist / LLMServingSim
View on GitHub
LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure
☆344Jul 15, 2026Updated last week
CMU-SAFARI / ramulator2
View on GitHub
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …
☆596Jul 6, 2026Updated 2 weeks ago
yousei-github / ChampSim-Ramulator
View on GitHub
A simulator integrates ChampSim and Ramulator.
☆23Updated this week
PSAL-POSTECH / PyTorchSim
View on GitHub
PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework
☆131Updated this week
casys-kaist / oaken
View on GitHub
Artifact for Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
☆17May 9, 2025Updated last year
SAITPublic / PIMSimulator
View on GitHub
Processing-In-Memory (PIM) Simulator
☆247Dec 12, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
darchr / dram-cache-model
View on GitHub
This is where gem5 based DRAM cache models live.
☆20Mar 23, 2023Updated 3 years ago
PrincetonUniversity / LLMCompass
View on GitHub
☆260Oct 24, 2025Updated 8 months ago
PSAL-POSTECH / M2NDP-public
View on GitHub
A Cycle-level simulator for M2NDP
☆40Aug 14, 2025Updated 11 months ago
leesou / H2-LLM-ISCA-2025
View on GitHub
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
☆113Apr 26, 2025Updated last year
VIA-Research / uPIMulator
View on GitHub
☆183Feb 1, 2025Updated last year
KFM135 / chiplet-optimizer
View on GitHub
This repository contains the code for this paper: Chiplet-Gym: An RL-based Optimization Framework for Chiplet-based AI Accelerator
☆22Sep 28, 2024Updated last year
godfather991 / UniNDP
View on GitHub
Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆60Sep 1, 2025Updated 10 months ago
arkhadem / DX100
View on GitHub
Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper
☆19Nov 6, 2025Updated 8 months ago
leesou / PIM-DL-ASPLOS
View on GitHub
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
☆37Feb 21, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
IPADS-SAI / WaferAI-SIM
View on GitHub
The wafer-native AI accelerator simulation platform and inference engine.
☆57Jan 1, 2026Updated 6 months ago
ranggihwang / Pregated_MoE
View on GitHub
☆62May 4, 2024Updated 2 years ago
harrylee365 / pcmcsim_public
View on GitHub
PCMCsim: An Accurate Phase-Change Memory Controller Simulator and its Performance Analysis (ISPASS 2022)
☆10Aug 3, 2024Updated last year
sarchlab / triosim
View on GitHub
☆42Jul 2, 2026Updated 2 weeks ago
spcl / rapidchiplet
View on GitHub
A toolchain for rapid design space exploration of chiplet architectures
☆87Jul 25, 2025Updated 11 months ago
casys-kaist / NeuPIMs
View on GitHub
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing
☆123Jun 19, 2024Updated 2 years ago
fangjh21 / PALM
View on GitHub
PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
☆21Jun 12, 2024Updated 2 years ago