feifeibear/swGEMM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/feifeibear/swGEMM)

feifeibear / swGEMM

A highly efficient library for GEMM operations on Sunway TaihuLight

☆18

Alternatives and similar repositories for swGEMM

Users that are interested in swGEMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

feifeibear / swDNN
View on GitHub
a highly-efficient library for deep neural networks based on Sunway TaihuLight supercomputer.
☆17Sep 3, 2018Updated 7 years ago
feifeibear / SWCaffe
View on GitHub
A Deep Learning Framework customized for Sunway TaihuLight
☆42Jan 8, 2019Updated 7 years ago
weifengliu-ssslab / Benchmark_SpTRSV_using_CSC
View on GitHub
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)
☆23Feb 14, 2020Updated 6 years ago
haswelliris / CPC2018-GROMACS
View on GitHub
CPC2018第二届国产CPU并行应用挑战赛决赛
☆11Oct 26, 2018Updated 7 years ago
rox906 / tcFFT
View on GitHub
☆43May 21, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sfilippone / mld2p4-2
View on GitHub
☆14Jul 16, 2020Updated 6 years ago
chemeng / GPGPU-GMRES-Method
View on GitHub
CUDA GPU implementation of GMRES iterative Solver
☆10Apr 16, 2012Updated 14 years ago
zeroine / cutlass-cute-sample
View on GitHub
☆49Apr 15, 2024Updated 2 years ago
SymbioticLab / Hydra
View on GitHub
Hydra adds resilience and high availability to remote memory solutions.
☆33Feb 22, 2022Updated 4 years ago
gevtushenko / block_matrix_format_performance
View on GitHub
☆12Jan 19, 2020Updated 6 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
spcl / arrow-matrix
View on GitHub
Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication
☆15Mar 25, 2024Updated 2 years ago
visiblehawk / foam-extend-4.1
View on GitHub
☆13Jan 18, 2020Updated 6 years ago
ecrc / polar
View on GitHub
Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix
☆14Jun 3, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhen-xie / IA-SpGEMM
View on GitHub
IA-SPGEMM
☆44Oct 19, 2024Updated last year
jyi2ya / tc-lab
View on GitHub
一起来数三角形吧！
☆10Jun 27, 2024Updated 2 years ago
feifeibear / PyTorchMemTracer
View on GitHub
Depict GPU memory footprint during DNN training of PyTorch
☆11Nov 17, 2022Updated 3 years ago
PaulSolt / GLUT-Object-Oriented-Framework
View on GitHub
A bare bones animation framework built around GLUT to ease graphics development projects.
☆20Dec 6, 2017Updated 8 years ago
arirepo / paraGMRES
View on GitHub
Massively Scalable Parallel GMRES C-code for Sparse System of Equations
☆13Feb 16, 2016Updated 10 years ago
hpcaitech / GPT-Demo
View on GitHub
GPT Demo with hybrid distributed training
☆10Dec 1, 2022Updated 3 years ago
amlatyrngom / SQLIR
View on GitHub
SQL Optimizations using MLIR
☆12Apr 5, 2020Updated 6 years ago
temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago
rocking5566 / SSD-MobileNet-ncnn
View on GitHub
SSD-MobileNet with Tencent ncnn framework
☆11Feb 13, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ChenhanYu / hmlp
View on GitHub
High-Performance Machine Learning Primitives
☆13Apr 17, 2021Updated 5 years ago
karlrupp / spgemm-mkl-benchmark
View on GitHub
Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:
☆12Sep 25, 2016Updated 9 years ago
gowerrobert / StochOpt.jl
View on GitHub
A suite of stochastic optimization methods for solving the empirical risk minimization problem.
☆17Nov 20, 2019Updated 6 years ago
YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
cyanguwa / nersc-roofline
View on GitHub
☆52Sep 5, 2020Updated 5 years ago
carekit-apple / IBM-HyperProtectSDK
View on GitHub
The IBM Hyper Protect iOS SDK for CareKit is an addon for the CareKit framework that consumes IBM Hyper Protect Services for zero-trust p…
☆12Sep 2, 2020Updated 5 years ago
TurboNLP / Translate-Demo
View on GitHub
A Translation Task using TurboTransformers
☆10Dec 17, 2020Updated 5 years ago
Olament / Hanzi2PinyinEngine
View on GitHub
Hanzi to Pinyin engine in Swift 拼音输入法引擎
☆13Mar 29, 2024Updated 2 years ago
billmuch / matmul_perf_test
View on GitHub
☆15Apr 15, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jpakkane / jpak
View on GitHub
Jpak compression format
☆15Mar 12, 2017Updated 9 years ago
Forwil / tvmt_v2
View on GitHub
☆10Aug 4, 2020Updated 5 years ago
ISRC-CAS / PLCT-OpenDay-2020
View on GitHub
PLCT实验室2020年开放日活动的演讲资料
☆13Dec 29, 2020Updated 5 years ago
dbindel / cs6210-f16
View on GitHub
Course repository for Cornell CS 6210, Fall 2016
☆18Nov 30, 2016Updated 9 years ago
enarx-archive / sevctl
View on GitHub
Administrative utility for AMD SEV
☆15Apr 13, 2022Updated 4 years ago
BG2BKK / my_benchmark
View on GitHub
benchmark for linux server
☆13Nov 6, 2016Updated 9 years ago
zhuzilin / pytorch-malloc
View on GitHub
An external memory allocator example for PyTorch.
☆16Aug 10, 2025Updated 11 months ago