the-data-lab / deep-codegen
☆9Updated last year
Alternatives and similar repositories for deep-codegen:
Users that are interested in deep-codegen are comparing it to the libraries listed below
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆31Updated last year
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆26Updated 3 months ago
- A highly-flexible GPU simulator for AMD GPUs.☆128Updated this week
- ☆129Updated 8 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆41Updated last week
- Performance Prediction Toolkit for GPUs☆36Updated 3 years ago
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆75Updated 9 months ago
- ☆60Updated 9 months ago
- ☆22Updated last month
- Sharing the codebase and steps for artifact evaluation/reproduction for MICRO 2024 paper☆9Updated 6 months ago
- A portable framework to map DFG (dataflow graph, representing an application) on spatial accelerators.☆36Updated 2 years ago
- ☆18Updated 11 months ago
- ☆100Updated 3 weeks ago
- ☆36Updated last year
- ☆10Updated 2 months ago
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆25Updated last month
- ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysis☆31Updated last year
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆24Updated 3 weeks ago
- Horizontal Fusion☆22Updated 3 years ago
- A Cycle-level simulator for M2NDP☆25Updated 4 months ago
- ☆25Updated 3 years ago
- LLTFI is a tool, which is an extension of LLFI, allowing users to run fault injection experiments on C/C++, TensorFlow and PyTorch applic…☆36Updated 5 months ago
- mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)☆48Updated 3 months ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆51Updated 2 weeks ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆41Updated last year
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆45Updated 7 months ago
- ☆49Updated last week
- ☆25Updated 4 years ago
- ☆131Updated last month
- ☆69Updated 4 years ago