LucasWilkinson/ASpT-mirror

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LucasWilkinson/ASpT-mirror)

LucasWilkinson / ASpT-mirror

Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding

☆17

Alternatives and similar repositories for ASpT-mirror

Users that are interested in ASpT-mirror are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ParCIS / Magicube
View on GitHub
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆92Nov 23, 2022Updated 3 years ago
CRAFT-THU / RoDe
View on GitHub
A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
☆30Nov 29, 2023Updated 2 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆74Oct 5, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lenLRX / AmpereSparseMatmul
View on GitHub
study of Ampere' Sparse Matmul
☆18Jan 10, 2021Updated 5 years ago
YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
abhinav-vaishya / Fast-Training-of-Convolutional-Networks-through-FFTs
View on GitHub
Implementation of the paper - Fast Training of Convolutional Networks through FFTs (CUDA for parallelization)
☆10May 8, 2020Updated 6 years ago
SuperScientificSoftwareLaboratory / DASP
View on GitHub
Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…
☆29Jun 18, 2024Updated 2 years ago
SpRegTiling / sparse-register-tiling
View on GitHub
☆10Mar 2, 2024Updated 2 years ago
GPUPeople / spECK
View on GitHub
Efficient SpGEMM on GPU using CUDA and CSR
☆61Jul 18, 2023Updated 3 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
dgSPARSE / dgSPARSE-Lib
View on GitHub
PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity
☆122Jul 13, 2026Updated last week
Bruce-Lee-LY / cuda_back2back_hgemm
View on GitHub
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
☆13Nov 3, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
puckbee / CVR
View on GitHub
Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)
☆24Feb 12, 2024Updated 2 years ago
Sable / fait-maison-spmv
View on GitHub
Sparse Matrix-Vector Multiplication implementations in C
☆22Dec 7, 2022Updated 3 years ago
oresths / tSparse
View on GitHub
A GPU algorithm for sparse matrix-matrix multiplication
☆74Oct 1, 2020Updated 5 years ago
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
lixiuhong / implicit_gemm_convolution
View on GitHub
☆14May 28, 2019Updated 7 years ago
microsoft / ConvStencil
View on GitHub
☆37Apr 10, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Qwesh157 / conv_op_optimization
View on GitHub
This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
☆44Sep 29, 2025Updated 9 months ago
ROCm / rocBLAS-Examples
View on GitHub
Examples illustrating usage of the rocBLAS library
☆17Aug 12, 2024Updated last year
AnonymousRepo123 / AlphaSparse
View on GitHub
A intelligent matrix format designer for SpMV
☆10Oct 10, 2023Updated 2 years ago
poojahira / spmv-cuda
View on GitHub
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆39Feb 5, 2019Updated 7 years ago
gevtushenko / block_matrix_format_performance
View on GitHub
☆12Jan 19, 2020Updated 6 years ago
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
StephanPreibisch / FourierConvolutionCUDALib
View on GitHub
Implementation of 3d non-separable convolution using CUDA & FFT Convolution
☆20Jan 15, 2019Updated 7 years ago
YulhwaKim / cutlass_tilesparse
View on GitHub
CUDA templates for tile-sparse matrix multiplication based on CUTLASS.
☆52Mar 1, 2018Updated 8 years ago
Luca-Dalmasso / matrixTransposeCUDA
View on GitHub
CUDA C simple application for Nvidia's GPU
☆11Jun 7, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Bruce-Lee-LY / memory_pool
View on GitHub
Simple and efficient memory pool is implemented with C++11.
☆10Jun 2, 2022Updated 4 years ago
Ivanrs297 / cuda-spmv-csr
View on GitHub
Parallel SpMV using CSR representation, built in CUDA
☆14Jun 27, 2020Updated 6 years ago
araij / rabbit_order
View on GitHub
☆49Jan 30, 2026Updated 5 months ago
wjc404 / GEMM_AVX512F
View on GitHub
SGEMM and DGEMM subroutines using AVX512F instructions.
☆15May 22, 2022Updated 4 years ago
marcsous / gpuSparse
View on GitHub
Matlab mex wrappers to cuSPARSE (NVIDIA)
☆11Dec 10, 2025Updated 7 months ago
UDC-GAC / venom
View on GitHub
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
☆62Nov 24, 2023Updated 2 years ago
atmughrabi / OpenGraph
View on GitHub
OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.
☆14Apr 27, 2024Updated 2 years ago