YusukeNagasaka/Batched-SpMM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YusukeNagasaka/Batched-SpMM)

YusukeNagasaka / Batched-SpMM

New batched algorithm for sparse matrix-matrix multiplication (SpMM)

☆16

Alternatives and similar repositories for Batched-SpMM

Users that are interested in Batched-SpMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
GPUPeople / spECK
View on GitHub
Efficient SpGEMM on GPU using CUDA and CSR
☆61Jul 18, 2023Updated 3 years ago
weifengliu-ssslab / bhSPARSE
View on GitHub
bhSPARSE: A Sparse BLAS Library
☆17Nov 6, 2015Updated 10 years ago
HipGraph / FusedMM
View on GitHub
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…
☆31Aug 12, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chenxuhao / caffe-escoin
View on GitHub
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
☆16Feb 28, 2019Updated 7 years ago
Guangxuan-Xiao / SPMM-CUDA
View on GitHub
☆13Jun 23, 2022Updated 4 years ago
temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
pigirons / spmv
View on GitHub
This is a tuned sparse matrix dense vector multiplication(SpMV) library
☆23Mar 21, 2016Updated 10 years ago
ParCIS / Magicube
View on GitHub
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆92Nov 23, 2022Updated 3 years ago
weifengliu-ssslab / Benchmark_SpGEMM_using_CSR
View on GitHub
CSR-based SpGEMM on nVidia and AMD GPUs
☆48Apr 9, 2016Updated 10 years ago
maltanar / spmv-vector-cache
View on GitHub
A Vector Caching Scheme for Streaming FPGA SpMV Accelerators
☆10Sep 7, 2015Updated 10 years ago
LucasWilkinson / ASpT-mirror
View on GitHub
Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding
☆17Oct 20, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
CMU-SAFARI / MemSchedSim
View on GitHub
This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…
☆12Jan 18, 2016Updated 10 years ago
xh5a5n6k6 / image-stitching
View on GitHub
Produce panoramic image from multiple photographs with overlapping fields of view written in C++17.
☆10Feb 19, 2020Updated 6 years ago
pchan1401-ICIL / Camera2FOV
View on GitHub
camera2 calculate fov angle
☆10Feb 1, 2017Updated 9 years ago
oresths / tSparse
View on GitHub
A GPU algorithm for sparse matrix-matrix multiplication
☆74Oct 1, 2020Updated 5 years ago
vtsynergy / bb_segsort
View on GitHub
☆21Aug 21, 2023Updated 2 years ago
CMU-SAFARI / ASMSim
View on GitHub
This simulator models multi core systems with primary focus on the memory hierarchy. It models a trace-based out-of-order core frontend a…
☆12Feb 12, 2016Updated 10 years ago
GATECH-EIC / LLM4HWDesign_Starting_Toolkit
View on GitHub
LLM4HWDesign Starting Toolkit
☆19Oct 4, 2024Updated last year
codyjrivera / tsm2x-imp
View on GitHub
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆35Jul 28, 2020Updated 5 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lenLRX / AmpereSparseMatmul
View on GitHub
study of Ampere' Sparse Matmul
☆18Jan 10, 2021Updated 5 years ago
ceruleangu / Block-Sparse-Benchmark
View on GitHub
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Aug 21, 2020Updated 5 years ago
YulhwaKim / cutlass_tilesparse
View on GitHub
CUDA templates for tile-sparse matrix multiplication based on CUTLASS.
☆52Mar 1, 2018Updated 8 years ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
microsoft / ConvStencil
View on GitHub
☆37Apr 10, 2024Updated 2 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR
View on GitHub
CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)
☆26May 12, 2015Updated 11 years ago
junxian-li-hpc / bilibili-favs-manage
View on GitHub
☆10Jun 27, 2026Updated 3 weeks ago
PAA-NCIC / GSWITCH
View on GitHub
A pattern-based algorithmic autotuner for graph processing on GPUs.
☆33Jun 25, 2025Updated last year
EBD-CREST / nsparse
View on GitHub
Sparse matrix computation library for GPU
☆59Jul 12, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sfilippone / mld2p4-2
View on GitHub
☆14Jul 16, 2020Updated 6 years ago
chemeng / GPGPU-GMRES-Method
View on GitHub
CUDA GPU implementation of GMRES iterative Solver
☆10Apr 16, 2012Updated 14 years ago
p404 / jaeger-elasticsearch-compose
View on GitHub
Docker-compose configuration for quick deployment of jaeger using as a Elasticsearch as a storage
☆14Jun 4, 2018Updated 8 years ago
libingbingdev / Facial_Keypoints
View on GitHub
人脸关键点检测--68 keypoints
☆10Nov 22, 2022Updated 3 years ago
robjsliwa / llama-agent
View on GitHub
Fun project to run your own LLM chat bot using llama.cpp
☆11Jun 9, 2023Updated 3 years ago
atmughrabi / OpenGraph
View on GitHub
OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.
☆14Apr 27, 2024Updated 2 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago