IBM/3D-CiM-LLM-Inference-Simulator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IBM/3D-CiM-LLM-Inference-Simulator)

IBM / 3D-CiM-LLM-Inference-Simulator

Simulator for LLM inference on an abstract 3D AIMC-based accelerator

☆30

Alternatives and similar repositories for 3D-CiM-LLM-Inference-Simulator

Users that are interested in 3D-CiM-LLM-Inference-Simulator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ECASLab / hls-fpga-accelerators
View on GitHub
Collection of kernel accelerators optimised for LLM execution
☆32Feb 26, 2026Updated 2 months ago
neurosim / DNN_NeuroSim_V1.4
View on GitHub
Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)
☆81Mar 9, 2025Updated last year
RakeshUIUC / multihead_attn_accelerator
View on GitHub
Accelerate multihead attention transformer model using HLS for FPGA
☆12Dec 7, 2023Updated 2 years ago
godfather991 / UniNDP
View on GitHub
Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆56Sep 1, 2025Updated 8 months ago
KULeuven-MICAS / zigzag-llm
View on GitHub
Model LLM inference on single-core dataflow accelerators
☆18Dec 16, 2025Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hplp / PiMulator
View on GitHub
Processing in Memory Emulation
☆27Feb 24, 2023Updated 3 years ago
shieldforever / NeuronQuant
View on GitHub
[ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation
☆19Mar 6, 2025Updated last year
CMU-SAFARI / SiMRA-DRAM
View on GitHub
Source code & scripts for experimental characterization and demonstration of 1) simultaneous many-row activation, 2) up to nine-input maj…
☆12May 17, 2024Updated 2 years ago
kabazoka / ViT-Accelerator
View on GitHub
（Not actively updating）Vision Transformer Accelerator implemented in Vivado HLS for Xilinx FPGAs.
☆20Dec 29, 2024Updated last year
adervay1 / CIMulator
View on GitHub
a Computing In Memory emULATOR framework
☆15May 19, 2024Updated 2 years ago
booniebears / CoMN
View on GitHub
☆18May 1, 2024Updated 2 years ago
upmem / upmem_llm_framework
View on GitHub
UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.
☆41Apr 8, 2026Updated last month
zondie17 / UCAS_HPCS
View on GitHub
UCAS High Performance Computing System 国科大高性能计算系统复习及试题
☆16May 27, 2022Updated 3 years ago
Accelergy-Project / processing-in-memory-design
View on GitHub
☆24Apr 20, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CMU-SAFARI / pLUTo
View on GitHub
pLUTo is a DRAM-based Processing-using-Memory architecture that leverages the high density of DRAM to enable the massively parallel stori…
☆18Jan 12, 2023Updated 3 years ago
zhengchen3 / HLS_Transformer
View on GitHub
c++ version of ViT
☆12Nov 13, 2022Updated 3 years ago
neurosim / 3D_NeuroSim_V1.0
View on GitHub
Benchmark framework of 3D integrated CIM accelerators for popular DNN inference, support both monolithic and heterogeneous 3D integration
☆27Sep 21, 2021Updated 4 years ago
SEAL-UCSB / NVSim
View on GitHub
NVSim - A performance, energy and area estimation tool for non-volatile memory (NVM)
☆139Aug 27, 2018Updated 7 years ago
githubofaliyev / SNN-DSE
View on GitHub
Hardware and software implementation of Sparsely-active SNNs
☆22Mar 6, 2026Updated 2 months ago
iankur / vqllm
View on GitHub
Residual vector quantization for KV cache compression in large language model
☆12Oct 22, 2024Updated last year
CRAFT-THU / XB-Sim
View on GitHub
A Unified Framework for Training, Mapping and Simulation of ReRAM-Based Convolutional Neural Network Acceleration
☆37May 19, 2022Updated 4 years ago
gkrish19 / SIAM
View on GitHub
Scalable In-Memory Acceleration With Mesh: Device, Circuits, Architecture, and Algorithm
☆16Oct 11, 2020Updated 5 years ago
maestro-project / magma
View on GitHub
☆19Jun 17, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
elliothe / pytorx
View on GitHub
Neural Network Evaluation Tool on Crossbar-based Accelerator with Resistive Memory
☆43Oct 30, 2019Updated 6 years ago
CMU-SAFARI / MIMDRAM
View on GitHub
Source code for the architectural simulator used for modeling the PUD system proposed in our HPCA 2024 paper `MIMDRAM: An End-to-End Proc…
☆29Sep 12, 2025Updated 8 months ago
smpanaro / apple-silicon-4bit-quant
View on GitHub
Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"
☆11Mar 31, 2024Updated 2 years ago
mit-emze / cimloop
View on GitHub
☆96Jan 4, 2026Updated 4 months ago
KULeuven-MICAS / zigzag-imc
View on GitHub
HW accelerator mapping optimization framework for in-memory computing
☆30Jun 3, 2025Updated 11 months ago
jgoeders / dac_sdc_2021_designs
View on GitHub
☆19Mar 16, 2022Updated 4 years ago
pulp-platform / rbe
View on GitHub
Reconfigurable Binary Engine
☆17Mar 23, 2021Updated 5 years ago
thu-nics / MNSIM-2.0
View on GitHub
A Behavior-Level Modeling Tool for Memristor-based Neuromorphic Computing Systems
☆201Nov 27, 2024Updated last year
adamgallas / FireFly-v1
View on GitHub
[TVLSI'23] This repository contains the source code for the paper "FireFly: A High-Throughput Hardware Accelerator for Spiking Neural Net…
☆24Apr 4, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IBM / p-kit
View on GitHub
Python library for the simulation of probabilistic circuits.
☆13May 9, 2026Updated last week
CristianCosci / Ant_Colony_Optimization_for_OSSP
View on GitHub
Open Shop Scheduling Problem resolution via Ant Colony Optimization algorithm.
☆14Mar 28, 2023Updated 3 years ago
DingdongD / SIMFNet
View on GitHub
A novel multi-domain fusion network based on sample intercorrelation (SIMFNet) is proposed. The first mulitvariate aquatic human activity…
☆29Dec 25, 2023Updated 2 years ago
CMU-SAFARI / pim-ml
View on GitHub
PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…
☆25Jan 7, 2025Updated last year
IBM / aihwkit
View on GitHub
IBM Analog Hardware Acceleration Kit
☆482Updated this week
prabdeb / openai-iot-speech-chatbot
View on GitHub
OpenAI GPT model to build your personal assistant in IoT devices. Just like Alexa, Google Assistant, Siri, etc. but with your own skills,…
☆12Aug 7, 2023Updated 2 years ago
takahiro-hirofuchi / mesmeric-emulator
View on GitHub
MESMERIC: A Software-based NVM Emulator Supporting Read/Write Asymmetric Latencies
☆10Oct 1, 2020Updated 5 years ago