A Cycle-level simulator for M2NDP
☆36Aug 14, 2025Updated 7 months ago
Alternatives and similar repositories for M2NDP-public
Users that are interested in M2NDP-public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 2, 2024Updated last year
- PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework☆100Updated this week
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆191Jan 8, 2026Updated 2 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆129May 3, 2025Updated 10 months ago
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆63Aug 11, 2024Updated last year
- Processing in Memory Emulation☆24Feb 24, 2023Updated 3 years ago
- A full-system, cycle-level simulator based on gem5 that provides complete support for all three CXL sub-protocols and all three types of …☆135Mar 4, 2026Updated 3 weeks ago
- (elastic) cuckoo hashing☆16Jun 20, 2020Updated 5 years ago
- ☆169Feb 1, 2025Updated last year
- A Full-System Simulator for CXL-Based SSD Memory System☆39Dec 24, 2024Updated last year
- ☆140Jun 24, 2024Updated last year
- PIMeval simulator and PIMbench suite☆46Nov 22, 2025Updated 4 months ago
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆17Nov 6, 2025Updated 4 months ago
- The source code for GPGPUSim+Ramulator simulator. In this version, GPGPUSim uses Ramulator to simulate the DRAM. This simulator is used t…☆60Sep 30, 2019Updated 6 years ago
- Processing-In-Memory (PIM) Simulator☆225Dec 12, 2024Updated last year
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆40Aug 6, 2025Updated 7 months ago
- Overcoming the IOTLB Wall for Multi-100-Gbps Linux-based Networking☆24May 16, 2023Updated 2 years ago
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆517Feb 4, 2026Updated last month
- Clio, ASPLOS'22.☆79Feb 8, 2022Updated 4 years ago
- ☆28Nov 29, 2024Updated last year
- A fast and flexible simulation infrastructure for exploring general-purpose processing-in-memory (PIM) architectures. Ramulator-PIM combi…☆182Oct 1, 2022Updated 3 years ago
- Johnny Cache: the End of DRAM Cache Conflicts (in Tiered Main Memory Systems)☆20Aug 2, 2023Updated 2 years ago
- LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models☆12May 7, 2024Updated last year
- Fast and accurate DRAM power and energy estimation tool☆192Updated this week
- ☆14Oct 30, 2024Updated last year
- ☆10Feb 10, 2025Updated last year
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆109Jun 19, 2024Updated last year
- ☆39Oct 14, 2025Updated 5 months ago
- Virtuoso is a fast, accurate and versatile simulation framework designed for virtual memory research. Virtuoso uses a new simulation met…☆85Updated this week
- Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆53Sep 1, 2025Updated 6 months ago
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆14Dec 9, 2024Updated last year
- PrIM (Processing-In-Memory benchmarks) is the first benchmark suite for a real-world processing-in-memory (PIM) architecture. PrIM is dev…☆169Apr 29, 2024Updated last year
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)☆72Dec 29, 2025Updated 2 months ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆10Jul 12, 2016Updated 9 years ago
- Cost Model☆19Apr 11, 2025Updated 11 months ago
- [IJCAI 2024] QiMeng-CPU-v1: Automated CPU Design by Learning from Input-Output Examples☆27May 4, 2025Updated 10 months ago
- ☆14Mar 10, 2024Updated 2 years ago
- High-Performance KV Cache Storage Engine on CXL Shared Memory for LLM Inference☆45Updated this week