Simulator for LLM inference on an abstract 3D AIMC-based accelerator
☆25Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for 3D-CiM-LLM-Inference-Simulator
Users that are interested in 3D-CiM-LLM-Inference-Simulator are comparing it to the libraries listed below
Sorting:
- Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆53Sep 1, 2025Updated 6 months ago
- Collection of kernel accelerators optimised for LLM execution☆27Nov 19, 2025Updated 3 months ago
- Accelerate multihead attention transformer model using HLS for FPGA☆11Dec 7, 2023Updated 2 years ago
- Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)☆76Mar 9, 2025Updated 11 months ago
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆15Mar 6, 2025Updated 11 months ago
- ☆18May 1, 2024Updated last year
- ☆24Apr 20, 2024Updated last year
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆37Aug 6, 2025Updated 6 months ago
- (Not actively updating)Vision Transformer Accelerator implemented in Vivado HLS for Xilinx FPGAs.☆19Dec 29, 2024Updated last year
- A Unified Framework for Training, Mapping and Simulation of ReRAM-Based Convolutional Neural Network Acceleration☆36May 19, 2022Updated 3 years ago
- UCAS High Performance Computing System 国科大高性能计算系统复习及试题☆16May 27, 2022Updated 3 years ago
- pLUTo is a DRAM-based Processing-using-Memory architecture that leverages the high density of DRAM to enable the massively parallel stori…☆18Jan 12, 2023Updated 3 years ago
- [ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators☆45May 25, 2024Updated last year
- Benchmark framework of 3D integrated CIM accelerators for popular DNN inference, support both monolithic and heterogeneous 3D integration☆26Sep 21, 2021Updated 4 years ago
- ARTICo³ - Dynamic and Partially Reconfigurable Architecture for Run-Time Adaptive, High Performance Embedded Computing☆12Sep 10, 2024Updated last year
- ☆83Jan 4, 2026Updated last month
- [ASPLOS 2019] PUMA-simulator provides a detailed simulation model of a dataflow architecture built with NVM (non-volatile memory), and ru…☆67Apr 17, 2023Updated 2 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- A simulator for SK hynix AiM PIM architecture based on Ramulator 2.0☆59Jul 22, 2025Updated 7 months ago
- ☆37Jun 23, 2025Updated 8 months ago
- NVSim - A performance, energy and area estimation tool for non-volatile memory (NVM)☆134Aug 27, 2018Updated 7 years ago
- ☆11Mar 14, 2023Updated 2 years ago
- CrossSim: accuracy simulation of analog in-memory computing☆196Mar 26, 2025Updated 11 months ago
- PIMeval simulator and PIMbench suite☆44Nov 22, 2025Updated 3 months ago
- A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm collisions; considering complex side constraints; and optimiz…☆11Jul 6, 2021Updated 4 years ago
- The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow☆47Jan 26, 2023Updated 3 years ago
- 基于FPGA的FFT算法并行优化☆12Mar 7, 2024Updated last year
- A GPU accelerated library for computing rigid body dynamics with analytical gradients☆13Feb 8, 2026Updated 2 weeks ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- ☆133Jun 24, 2024Updated last year
- Neural Network Evaluation Tool on Crossbar-based Accelerator with Resistive Memory☆43Oct 30, 2019Updated 6 years ago
- Direction Of Arrival (DOA) simulation tools with Nec2☆13Apr 24, 2017Updated 8 years ago
- ☆11Apr 5, 2023Updated 2 years ago
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆107Jun 19, 2024Updated last year
- A lightweight implementation of MPC and NMPC in C++ using Eigen3☆10Oct 27, 2023Updated 2 years ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- Kinodyanmic Parallel Accelerated eXpansion☆13Sep 9, 2024Updated last year
- Uber Autonomous Visualization System☆12Jan 4, 2023Updated 3 years ago