hpcgarage/cuASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hpcgarage/cuASR)

hpcgarage / cuASR

cuASR: CUDA Algebra for Semirings

☆49

Alternatives and similar repositories for cuASR

Users that are interested in cuASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

metagraph-dev / mlir-graphblas
View on GitHub
MLIR tools and dialect for GraphBLAS
☆18Mar 30, 2022Updated 4 years ago
AlgebraicJulia / StructuredDecompositions.jl
View on GitHub
Structured decompositions!
☆15Mar 26, 2025Updated last year
gty111 / SimpleUseGpgpuSim
View on GitHub
GPGPU-SIM 使用篇
☆14Nov 12, 2022Updated 3 years ago
getianao / ngAP
View on GitHub
ngAP's artifact for ASPLOS'24
☆25Jul 29, 2025Updated 11 months ago
escalab / SIMD2
View on GitHub
☆31Jun 15, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TensorBFS / TropicalNumbers.jl
View on GitHub
Tropical Numbers
☆17Nov 2, 2025Updated 8 months ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
twalgor / tw
View on GitHub
☆14Feb 15, 2022Updated 4 years ago
NGIOproject / PMTutorial
View on GitHub
Slides and exercises for persistent memory programming tutorial
☆14Nov 14, 2022Updated 3 years ago
txstc55 / EGGS
View on GitHub
EGGS, a method to speed up sparse matrix operations when the same sparsity is used for multiple times. This repo contains examples that s…
☆26Aug 4, 2020Updated 5 years ago
arcsysu / SYSU-ARCH
View on GitHub
SYSU-ARCH is a LAB that focuses on the use and extending of simulators.
☆10Dec 19, 2022Updated 3 years ago
gunrock / loops
View on GitHub
🎃 GPU load-balancing library for regular and irregular computations.
☆67Jun 25, 2026Updated 3 weeks ago
LighthouseHPC / lighthouse
View on GitHub
☆11Apr 10, 2019Updated 7 years ago
xuanzhaogao / TreeWidthSolver.jl
View on GitHub
Implementation of the tree width algorithms.
☆19Nov 24, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
aoli-al / HFuse
View on GitHub
Horizontal Fusion
☆24Jan 7, 2022Updated 4 years ago
chhzh123 / Krill
View on GitHub
An efficient concurrent graph processing system
☆46Oct 27, 2021Updated 4 years ago
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
ROCm / roc-stdpar
View on GitHub
☆20Jan 17, 2024Updated 2 years ago
SparseLinearAlgebra / cuBool
View on GitHub
Sparse linear Boolean algebra for Nvidia Cuda
☆27Nov 17, 2025Updated 8 months ago
GiggleLiu / YaoTutorial
View on GitHub
A tutorial for Yao.jl
☆11Oct 9, 2023Updated 2 years ago
Runjing-Liu120 / RaoBlackwellizedSGD
View on GitHub
A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions
☆22May 5, 2019Updated 7 years ago
bergen / EdgeTransformer
View on GitHub
☆22Dec 1, 2021Updated 4 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
gcoe-dresden / cuda-gpu-tlb
View on GitHub
TLB Benchmarks
☆35Sep 11, 2017Updated 8 years ago
HPAC / linnea
View on GitHub
Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.
☆70Aug 24, 2025Updated 10 months ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
NVlabs / ptxmemorymodel
View on GitHub
☆77May 29, 2019Updated 7 years ago
TensorBFS / Gaussian-fPEPS
View on GitHub
Code for the paper "Projected d-wave superconducting state: a fermionic projected entangled pair state study"
☆16Aug 6, 2025Updated 11 months ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
clouren / SPEX
View on GitHub
☆15Apr 8, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mcoavoux / mtg
View on GitHub
Statistical discontinuous constituent parsing
☆11Feb 15, 2018Updated 8 years ago
MegEngine / cutlass-bak
View on GitHub
modified cutlass
☆16Oct 26, 2020Updated 5 years ago
yarrow-id / diagrams
View on GitHub
string diagrams for the working programmer
☆15Jul 17, 2023Updated 3 years ago
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
uuudown / SBNN
View on GitHub
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆17Dec 9, 2020Updated 5 years ago
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year