microsoft/ConvStencil

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/ConvStencil)

microsoft / ConvStencil

☆37

Alternatives and similar repositories for ConvStencil

Users that are interested in ConvStencil are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago
ParCIS / Magicube
View on GitHub
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆92Nov 23, 2022Updated 3 years ago
interestingLSY / hpcgame.minesweeper
View on GitHub
☆14Jan 18, 2023Updated 3 years ago
MoZeWei / moTuner
View on GitHub
☆10May 12, 2022Updated 4 years ago
LucasWilkinson / ASpT-mirror
View on GitHub
Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding
☆17Oct 20, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhisbug / Cavs
View on GitHub
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆15Sep 18, 2020Updated 5 years ago
marsupialtail / gpu-sparsert
View on GitHub
☆18Oct 15, 2020Updated 5 years ago
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
weifengliu-ssslab / Benchmark_SpTRSM_using_CSC
View on GitHub
Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)
☆17Feb 14, 2020Updated 6 years ago
spcl / smat
View on GitHub
Code for High Performance Unstructured SpMM Computation Using Tensor Cores
☆35Nov 3, 2024Updated last year
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆74Oct 5, 2020Updated 5 years ago
vortexgpgpu / Volt
View on GitHub
☆17Feb 9, 2026Updated 5 months ago
rox906 / tcFFT
View on GitHub
☆43May 21, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
weifengliu-ssslab / Benchmark_SpTRSV_using_CSC
View on GitHub
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)
☆23Feb 14, 2020Updated 6 years ago
UofT-EcoSystem / Minuet
View on GitHub
[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
☆80Jun 7, 2024Updated 2 years ago
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
amazon-science / FeatGraph
View on GitHub
☆69Jun 16, 2021Updated 5 years ago
lemyx / tilelang-dsa
View on GitHub
DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang
☆47Nov 19, 2025Updated 8 months ago
redbird-arch / isca2025-chimera-artifact
View on GitHub
Artifact of Chimera
☆18May 6, 2025Updated last year
AyakaGEMM / Hands-on-MLIR
View on GitHub
☆17May 14, 2024Updated 2 years ago
xiezhq-hermann / graphiler
View on GitHub
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Oct 3, 2022Updated 3 years ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HPMLL / DTC-SpMM_ASPLOS24
View on GitHub
☆47Jun 19, 2024Updated 2 years ago
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
cslab-ntua / SpMV-Research
View on GitHub
☆24Jun 12, 2026Updated last month
ParCIS / FlashSparse
View on GitHub
FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…
☆39Oct 5, 2025Updated 9 months ago
EnigmaHuang / Saad_Book_ForTran
View on GitHub
Some "Formula Translations" for Yousef Saad's book "Iterative Methods for Sparse Linear Systems (2nd Edition)"
☆13Jan 14, 2018Updated 8 years ago
nox-410 / tvm.tl
View on GitHub
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆52Jul 23, 2024Updated 2 years ago
microsoft / TrainVerify
View on GitHub
A verification tool for ensuring parallelization equivalence in distributed model training.
☆17Sep 1, 2025Updated 10 months ago
CtopCsUtahEdu / bricklib
View on GitHub
Distributed Performance-portable Stencil Compuitation
☆10Jul 9, 2023Updated 3 years ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
apuaaChen / EVT_AE
View on GitHub
Artifacts of EVT ASPLOS'24
☆29Mar 6, 2024Updated 2 years ago
sfilippone / mld2p4-2
View on GitHub
☆14Jul 16, 2020Updated 6 years ago
OP-DSL / clang-op-translator
View on GitHub
Clang-based translator for OP2
☆12Jul 17, 2022Updated 4 years ago
NVIDIA / jax-tvm-ffi
View on GitHub
JAX support for tvm-ffi abi
☆26May 14, 2026Updated 2 months ago
chemeng / GPGPU-GMRES-Method
View on GitHub
CUDA GPU implementation of GMRES iterative Solver
☆10Apr 16, 2012Updated 14 years ago
aoli-al / HFuse
View on GitHub
Horizontal Fusion
☆24Jan 7, 2022Updated 4 years ago
zlib-ng / pigzbench
View on GitHub
Test of parallel compression acceleration
☆16Sep 8, 2024Updated last year