wang-luhan / VNECLinks
VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs
☆10Updated last year
Alternatives and similar repositories for VNEC
Users that are interested in VNEC are comparing it to the libraries listed below
Sorting:
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Updated 4 years ago
- ☆78Updated 4 years ago
- Performance Prediction Toolkit for GPUs☆37Updated 3 years ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆100Updated 5 months ago
- A highly-flexible GPU simulator for AMD GPUs.☆193Updated this week
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆36Updated 5 months ago
- Rodinia benchmark☆189Updated 2 years ago
- ☆12Updated 3 years ago
- PolyBench/C benchmark suite (version 4.2.1 beta) from http://web.cse.ohio-state.edu/~pouchet/software/polybench/☆119Updated 9 years ago
- ngAP's artifact for ASPLOS'24☆24Updated 3 months ago
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆37Updated 10 months ago
- The Sniper Multi-Core Simulator☆154Updated 2 weeks ago
- A speculative mechanism to accelerate long-latency off-chip load requests by removing on-chip cache access latency from their critical pa…☆75Updated last month
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆49Updated 7 years ago
- Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"☆35Updated 2 weeks ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆67Updated this week
- ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysis☆34Updated last year
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆59Updated last year
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆36Updated 2 years ago
- ☆21Updated this week
- ☆30Updated 5 years ago
- A fast and flexible simulation infrastructure for exploring general-purpose processing-in-memory (PIM) architectures. Ramulator-PIM combi…☆179Updated 3 years ago
- A flexible, high-performance, user-friendly computer architecture simulator engine☆90Updated last week
- Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"☆15Updated 2 months ago
- Horizontal Fusion☆24Updated 3 years ago
- The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow☆39Updated 2 years ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆26Updated last year
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆29Updated 2 weeks ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆31Updated 4 months ago
- PyGim is the first runtime framework to efficiently execute Graph Neural Networks (GNNs) on real Processing-in-Memory systems. It provide…☆31Updated 6 months ago