HipGraph / FusedMMLinks

Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"

☆31

Alternatives and similar repositories for FusedMM

Users that are interested in FusedMM are comparing it to the libraries listed below

Sorting:

hgyhungry / ge-spmm
☆109Updated 4 years ago
owensgroup / merge-spmm
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆72Updated 4 years ago
xxcclong / GNN-Computing
Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"
☆39Updated 3 years ago
xiezhq-hermann / graphiler
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Updated 2 years ago
amazon-science / FeatGraph
☆70Updated 4 years ago
YukeWang96 / TC-GNN_ATC23
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆49Updated last year
YukeWang96 / GNNAdvisor_OSDI21
Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.
☆66Updated 2 years ago
CMUAbstract / Graph-Reordering-IISWC18
Repo for the IISWC 2018 submission
☆9Updated 3 years ago
PASSIONLab / CAGNET
☆47Updated 3 months ago
zjjzby / GNN-hardware-acceleration-paper
This repo is to collect the state-of-the-art GNN hardware acceleration paper
☆54Updated 4 years ago
YusukeNagasaka / Batched-SpMM
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16Updated 6 years ago
jiazhihao / ROC
Distributed Multi-GPU GNN Framework
☆36Updated 5 years ago
dglai / FeatGraph
Sparse kernels for GNNs based on TVM
☆17Updated 4 years ago
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Updated last year
HPCRL / ASPLOS_artifact
☆13Updated 3 years ago
CMU-SAFARI / SparseP
SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…
☆75Updated 3 years ago
ceruleangu / Block-Sparse-Benchmark
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆24Updated 4 years ago
araij / rabbit_order
☆46Updated 2 years ago
dgSPARSE / dgNN
[Mlsys'22] Understanding gnn computational graph: A coordinated computation, io, and memory perspective
☆20Updated last year
YukeWang96 / QGTC_PPoPP22
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
☆30Updated 3 years ago
PAA-NCIC / GSWITCH
A pattern-based algorithmic autotuner for graph processing on GPUs.
☆31Updated last month
ParCIS / Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆89Updated 2 years ago
GATECH-EIC / GCoD
[HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆36Updated 3 years ago
concept-inversion / C-SAW
A Framework for Graph Sampling and Random Walk on GPUs.
☆38Updated 6 months ago
GATECH-EIC / BNS-GCN
[MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …
☆56Updated last year
escalab / SIMD2
☆31Updated 3 years ago
cornell-zhang / GraphLily
A graph linear algebra overlay
☆51Updated 2 years ago
rishucoding / reproduce_MICRO24_GPU_DLRM_inference
Sharing the codebase and steps for artifact evaluation/reproduction for MICRO 2024 paper
☆9Updated 11 months ago
Xtra-Computing / G3
G3: A Programmable GNN Training System on GPU
☆43Updated 4 years ago
cornell-zhang / UniSparse
Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization
☆30Updated 8 months ago