Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)
☆24Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for CVR
Users that are interested in CVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆23Mar 21, 2016Updated 10 years ago
- A New Format for SIMD-accelerated SpMV☆22Apr 4, 2022Updated 4 years ago
- The SparseX sparse kernel optimization library☆43Jan 16, 2019Updated 7 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- ☆99Feb 10, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sparse Matrix-Vector Multiplication implementations in C☆22Dec 7, 2022Updated 3 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆111Jun 10, 2024Updated 2 years ago
- VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs☆10Feb 6, 2024Updated 2 years ago
- General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)☆12Apr 8, 2021Updated 5 years ago
- OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.☆14Apr 27, 2024Updated 2 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Jun 8, 2018Updated 8 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆17Oct 20, 2021Updated 4 years ago
- SpMV using CUDA☆20Mar 5, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Apr 23, 2020Updated 6 years ago
- ☆34Feb 20, 2022Updated 4 years ago
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Jun 23, 2016Updated 9 years ago
- AutoRNP -- Automated Repair of High Floating-Point Errors in Numerical Libraries☆12Dec 28, 2018Updated 7 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Sep 27, 2019Updated 6 years ago
- BigDataBench Spark workloads☆11Jul 15, 2016Updated 9 years ago
- Optimizations on Graph500☆10Jul 15, 2016Updated 9 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆11May 28, 2026Updated last week
- Logger for MPI communication☆28Jul 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆80Jun 29, 2022Updated 3 years ago
- ☆14Sep 22, 2019Updated 6 years ago
- A reinforcement learning algorithm for congestion control, together with a realistic Omnet++ network simulation environment☆36Jul 20, 2023Updated 2 years ago
- A portable and efficient infrastracture for value profilers. Doc: https://vclinic.readthedocs.io/en/latest/index.html☆14Mar 4, 2026Updated 3 months ago
- How to call NVTX from Fortran☆13Jun 25, 2025Updated 11 months ago
- RDFS: an erasure code based cloud storage system☆38Jul 28, 2014Updated 11 years ago
- Parallel SpMV using CSR representation, built in CUDA☆14Jun 27, 2020Updated 5 years ago
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 4 years ago
- Python tools for NVIDIA Profiler☆21Dec 21, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- associative floating point addition☆19Apr 30, 2024Updated 2 years ago
- FFT-accelerated inductance extractor for voxelized structures☆18Jan 2, 2020Updated 6 years ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 6 months ago
- The official website of One Student One Chip project.☆12Feb 5, 2026Updated 4 months ago
- This repository contains some tools to monitor the UNC_CBO_CACHE_LOOKUP event of the C-Boxes.☆12Oct 11, 2017Updated 8 years ago
- Vectorized implementations of hash join algorithms on Intel Xeon Phi (KNL)☆15Feb 3, 2018Updated 8 years ago
- Concurrent Log-Structured Memory for Many-Core Key-Value Stores☆36Jul 7, 2020Updated 5 years ago