Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)
☆24Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for CVR
Users that are interested in CVR are comparing it to the libraries listed below
Sorting:
- A New Format for SIMD-accelerated SpMV☆22Apr 4, 2022Updated 3 years ago
- The SparseX sparse kernel optimization library☆43Jan 16, 2019Updated 7 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- ☆98Feb 10, 2017Updated 9 years ago
- Sparse Matrix-Vector Multiplication implementations in C☆22Dec 7, 2022Updated 3 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆110Jun 10, 2024Updated last year
- VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs☆10Feb 6, 2024Updated 2 years ago
- General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)☆12Apr 8, 2021Updated 4 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Jun 8, 2018Updated 7 years ago
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- SpMV using CUDA☆20Mar 5, 2018Updated 8 years ago
- ☆34Feb 20, 2022Updated 4 years ago
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Jun 23, 2016Updated 9 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- QEMU 训练营教学文档☆26Nov 7, 2025Updated 4 months ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Sep 27, 2019Updated 6 years ago
- BigDataBench Spark workloads☆11Jul 15, 2016Updated 9 years ago
- Optimizations on Graph500☆10Jul 15, 2016Updated 9 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Updated this week
- Logger for MPI communication☆27Jul 12, 2023Updated 2 years ago
- ☆23Feb 5, 2026Updated last month
- A reinforcement learning algorithm for congestion control, together with a realistic Omnet++ network simulation environment☆36Jul 20, 2023Updated 2 years ago
- A portable and efficient infrastracture for value profilers. Doc: https://vclinic.readthedocs.io/en/latest/index.html☆14Mar 4, 2026Updated 2 weeks ago
- RDFS: an erasure code based cloud storage system☆38Jul 28, 2014Updated 11 years ago
- Parallel SpMV using CSR representation, built in CUDA☆14Jun 27, 2020Updated 5 years ago
- ☆12Jan 19, 2020Updated 6 years ago
- Python tools for NVIDIA Profiler☆21Dec 21, 2017Updated 8 years ago
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- associative floating point addition☆19Apr 30, 2024Updated last year
- FFT-accelerated inductance extractor for voxelized structures☆17Jan 2, 2020Updated 6 years ago
- SGEMM and DGEMM subroutines using AVX512F instructions.☆15May 22, 2022Updated 3 years ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 3 months ago
- COSMOlogical General Relativity And (Perfect fluid | Particle) Hydrodynamics☆15Oct 21, 2019Updated 6 years ago
- Topic supervised non-negative matrix factorization with sparse matrices☆12Mar 24, 2020Updated 5 years ago
- Spen's Official OpenOCD Mirror (no pull requests)☆12Jan 27, 2020Updated 6 years ago
- This repository contains some tools to monitor the UNC_CBO_CACHE_LOOKUP event of the C-Boxes.☆12Oct 11, 2017Updated 8 years ago
- Port of GRChombo to AMReX - under development!☆12Updated this week
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago
- CacheDirector - Sending Packets to the Right Slice by Exploiting Intel Last-Level Cache Addressing☆12Apr 29, 2019Updated 6 years ago