PrincetonUniversity/LLMCompass

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PrincetonUniversity/LLMCompass)

PrincetonUniversity / LLMCompass

☆260

Alternatives and similar repositories for LLMCompass

Users that are interested in LLMCompass are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scalesim-project / SCALE-Sim
View on GitHub
Repository to host and maintain SCALE-Sim code
☆498Jun 28, 2026Updated 3 weeks ago
leesou / H2-LLM-ISCA-2025
View on GitHub
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
☆112Apr 26, 2025Updated last year
casys-kaist / NeuPIMs
View on GitHub
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing
☆123Jun 19, 2024Updated 2 years ago
PSAL-POSTECH / ONNXim
View on GitHub
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆209Jan 8, 2026Updated 6 months ago
SAITPublic / PIMSimulator
View on GitHub
Processing-In-Memory (PIM) Simulator
☆245Dec 12, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
accel-sim / accel-sim-framework
View on GitHub
This is the top-level repository for the Accel-Sim framework.
☆625Mar 24, 2026Updated 3 months ago
casys-kaist / LLMServingSim
View on GitHub
LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure
☆340Updated this week
NVlabs / timeloop
View on GitHub
Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.
☆504Jun 22, 2026Updated 3 weeks ago
mit-han-lab / spatten
View on GitHub
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆136Aug 27, 2024Updated last year
maestro-project / maestro
View on GitHub
An analytical cost model evaluating DNN mappings (dataflows and tiling).
☆258Apr 15, 2024Updated 2 years ago
PKUZHOU / GNNear-PACT-2022
View on GitHub
GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing
☆17Sep 15, 2022Updated 3 years ago
scale-snu / attacc_simulator
View on GitHub
☆158Jun 24, 2024Updated 2 years ago
astra-sim / astra-sim
View on GitHub
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
☆641Apr 25, 2026Updated 2 months ago
Yufeng98 / CENT
View on GitHub
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆141May 3, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SET-Scheduling-Project / GEMINI-HPCA2024
View on GitHub
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
☆116Apr 28, 2025Updated last year
diwu1990 / uSystolic-Sim
View on GitHub
A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.
☆84Nov 7, 2021Updated 4 years ago
PrincetonUniversity / muchiSim
View on GitHub
Simulator framework for analysis of performance, energy consumption, area and cost of multi-node multi-chiplet tile-based manycore design…
☆76Jun 30, 2024Updated 2 years ago
ConvolutedDog / gpgpu-sim-comments
View on GitHub
GPGPU-Sim 中文注释版代码，包含 GPGPU-Sim 模拟器的最新版代码，经过中文注释，以帮助中文用户更好地理解和使用该模拟器。
☆30Dec 18, 2024Updated last year
suchandler96 / gem5-NVDLA
View on GitHub
☆45Mar 31, 2025Updated last year
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆133Jun 27, 2023Updated 3 years ago
harvard-acc / EdgeBERT
View on GitHub
HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference
☆54Mar 24, 2024Updated 2 years ago
ConvolutedDog / HyFiSS
View on GitHub
HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs
☆42Dec 9, 2024Updated last year
hahnyuan / LLM-Viewer
View on GitHub
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…
☆661Sep 11, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
clevercool / ANT-Quantization
View on GitHub
☆123Nov 17, 2023Updated 2 years ago
abhibambhaniya / GenZ-LLM-Analyzer
View on GitHub
LLM Inference analyzer for different hardware platforms
☆119Jun 23, 2026Updated 3 weeks ago
ChaseLab-PKU / InstAttention
View on GitHub
InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference
☆18Mar 30, 2025Updated last year
arkhadem / aim_simulator
View on GitHub
A simulator for SK hynix AiM PIM architecture based on Ramulator 2.0
☆69Jul 22, 2025Updated 11 months ago
ucb-bar / cosa
View on GitHub
A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)
☆85Aug 28, 2023Updated 2 years ago
CLab-HKUST-GZ / micro58-axcore
View on GitHub
☆41Oct 21, 2025Updated 8 months ago
mutinifni / splitwise-sim
View on GitHub
LLM serving cluster simulator
☆157Apr 25, 2024Updated 2 years ago
microsoft / vidur
View on GitHub
Accurate, large-scale, and extensible simulator for LLM inference Systems
☆642Jul 25, 2025Updated 11 months ago
PrincetonUniversity / ttm-cas
View on GitHub
☆16Oct 15, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
arkhadem / DX100
View on GitHub
Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper
☆19Nov 6, 2025Updated 8 months ago
Accelergy-Project / micro22-sparseloop-artifact
View on GitHub
MICRO22 artifact evaluation for Sparseloop
☆48Aug 8, 2022Updated 3 years ago
pku-liang / Sanger
View on GitHub
A co-design architecture on sparse attention
☆55Aug 23, 2021Updated 4 years ago
umd-memsys / DRAMsim3
View on GitHub
DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator
☆492Aug 3, 2024Updated last year
ARM-software / SCALE-Sim
View on GitHub
☆385May 11, 2023Updated 3 years ago
SET-Scheduling-Project / SET-ISCA2023
View on GitHub
The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.
☆83Mar 12, 2025Updated last year
godfather991 / UniNDP
View on GitHub
Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆60Sep 1, 2025Updated 10 months ago