PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
☆36Feb 21, 2024Updated 2 years ago
Alternatives and similar repositories for PIM-DL-ASPLOS
Users that are interested in PIM-DL-ASPLOS are comparing it to the libraries listed below
Sorting:
- [VLDB'23] A Skew-Resistant Index for Processing-in-Memory☆27Jan 5, 2026Updated 2 months ago
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆40Aug 6, 2025Updated 7 months ago
- ☆28Nov 29, 2024Updated last year
- ☆17Jul 24, 2023Updated 2 years ago
- Processing-In-Memory (PIM) Simulator☆225Dec 12, 2024Updated last year
- Examples of DPU programs using the UPMEM DPU SDK☆47Jan 30, 2025Updated last year
- ☆17Jun 4, 2025Updated 9 months ago
- PrIM (Processing-In-Memory benchmarks) is the first benchmark suite for a real-world processing-in-memory (PIM) architecture. PrIM is dev…☆169Apr 29, 2024Updated last year
- PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…☆25Jan 7, 2025Updated last year
- ☆139Jun 24, 2024Updated last year
- ☆11May 8, 2025Updated 10 months ago
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆23Apr 2, 2025Updated 11 months ago
- [ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators☆45May 25, 2024Updated last year
- ☆41Feb 23, 2026Updated 3 weeks ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆129May 3, 2025Updated 10 months ago
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆109Jun 19, 2024Updated last year
- ☆23May 30, 2025Updated 9 months ago
- ☆20Jun 1, 2023Updated 2 years ago
- TransPimLib is a library for transcendental (and other hard-to-calculate) functions in general-purpose PIM systems, TransPimLib provides …☆15Apr 21, 2023Updated 2 years ago
- ☆20Sep 28, 2024Updated last year
- Processing in Memory Emulation☆24Feb 24, 2023Updated 3 years ago
- ☆169Feb 1, 2025Updated last year
- GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing☆16Sep 15, 2022Updated 3 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆17Nov 6, 2025Updated 4 months ago
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆63Aug 11, 2024Updated last year
- PyGim is the first runtime framework to efficiently execute Graph Neural Networks (GNNs) on real Processing-in-Memory systems. It provide…☆33Apr 23, 2025Updated 10 months ago
- A Behavior-Level Modeling Tool for Memristor-based Neuromorphic Computing Systems☆200Nov 27, 2024Updated last year
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆19Mar 5, 2023Updated 3 years ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆12Jul 13, 2023Updated 2 years ago
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆32Oct 18, 2025Updated 5 months ago
- ☆13Mar 6, 2023Updated 3 years ago
- H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference☆93Apr 26, 2025Updated 10 months ago
- [ASPLOS 2019] PUMA-simulator provides a detailed simulation model of a dataflow architecture built with NVM (non-volatile memory), and ru…☆67Apr 17, 2023Updated 2 years ago
- PIMeval simulator and PIMbench suite☆46Nov 22, 2025Updated 3 months ago
- This repository contains an extended version of SMCSim (originally by Erfan Azarkhish), used for near-data-processing research by Jiwon C…☆14Nov 24, 2020Updated 5 years ago
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- A Full-System Framework for Simulating NDP devices from Caches to DRAM☆21Jan 12, 2024Updated 2 years ago
- A Scalable BFS Accelerator on FPGA-HBM Platform☆15Feb 22, 2024Updated 2 years ago