ecrc/kblas-gpu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ecrc/kblas-gpu)

ecrc / kblas-gpu

Subset of BLAS routines optimized for NVIDIA GPUs

☆80

Alternatives and similar repositories for kblas-gpu

Users that are interested in kblas-gpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eth-cscs / spla
View on GitHub
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…
☆32Jun 26, 2024Updated 2 years ago
zhangyf-neu / maiter
View on GitHub
Automatically exported from code.google.com/p/maiter
☆14Jan 22, 2021Updated 5 years ago
shamanDevel / cuMat
View on GitHub
An expression template based linear algebra library running completely on the GPU using CUDA
☆26Jun 24, 2021Updated 5 years ago
libocca / occa.py
View on GitHub
OCCA Python API: JIT Compilation for Multiple Architectures
☆11Dec 20, 2019Updated 6 years ago
ecrc / polar
View on GitHub
Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix
☆14Jun 3, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
olcf / NVIDIA-tensor-core-examples
View on GitHub
☆20Nov 7, 2019Updated 6 years ago
llnl / uberenv
View on GitHub
Automates using spack to build and deploy software
☆30Updated this week
LuxGraph / Lux
View on GitHub
A Distributed Multi-GPU System for Fast Graph Processing
☆65Oct 25, 2018Updated 7 years ago
llnl / H5Z-ZFP
View on GitHub
A registered ZFP compression plugin for HDF5
☆55Jul 15, 2026Updated last week
ChASE-library / ChASE
View on GitHub
This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…
☆20Jul 8, 2026Updated 2 weeks ago
netsyslab / Totem
View on GitHub
A graph processing engine for hybrid CPU and GPU platforms
☆40Feb 7, 2019Updated 7 years ago
pkestene / ppkMHD
View on GitHub
MPI+Kokkos implementation of spectral difference method (SDM) high order schemes
☆30Feb 2, 2025Updated last year
chenxuhao / caffe-escoin
View on GitHub
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
☆16Feb 28, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SparseBLAS / spblas-reference
View on GitHub
☆41Jun 29, 2026Updated 3 weeks ago
pkestene / MS-HPC-AI-GPU
View on GitHub
resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI
☆23Jan 11, 2024Updated 2 years ago
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
wdmapp / gtensor
View on GitHub
GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.
☆37Mar 5, 2026Updated 4 months ago
GPUEngineering / RapidNet
View on GitHub
GPU-powered stochastic MPC for drinking water networks
☆16Sep 12, 2022Updated 3 years ago
pmodels / yaksa
View on GitHub
Yaksa: High-performance Noncontiguous Data Management
☆17Oct 1, 2025Updated 9 months ago
LighthouseHPC / lighthouse
View on GitHub
☆11Apr 10, 2019Updated 7 years ago
sandialabs / LAPIS
View on GitHub
An MLIR-based compiler targeting Kokkos and other programming models
☆17Jul 14, 2026Updated last week
pmodels / bolt
View on GitHub
Official BOLT Repository
☆33Aug 16, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ap-hynninen / cutt
View on GitHub
CUDA Tensor Transpose (cuTT) library
☆55Aug 10, 2017Updated 8 years ago
lssfau / hyteg
View on GitHub
HyTeG (Hybrid Tetrahedral Grids) is a C++ framework for large scale high performance finite element simulations based on (but not limited…
☆19Jul 2, 2026Updated 2 weeks ago
ginkgo-project / ginkgo
View on GitHub
Numerical linear algebra software package
☆611Updated this week
patflick / miopen-benchmark
View on GitHub
benchmarking miopen
☆17Jan 14, 2019Updated 7 years ago
hclhkbu / gcoospdm
View on GitHub
Sparse-dense matrix-matrix multiplication on GPUs
☆14Oct 15, 2018Updated 7 years ago
sandialabs / lgrtk
View on GitHub
Tool Kit for Lagrangian Grid Reconnection
☆24May 26, 2023Updated 3 years ago
NVIDIA / NVPLSamples
View on GitHub
NVIDIA Performance Libraries: Sample code
☆23May 28, 2026Updated last month
cp2k / dbcsr
View on GitHub
DBCSR: Distributed Block Compressed Sparse Row matrix library
☆155Updated this week
hornet-gt / hornet
View on GitHub
Hornet data structure for sparse dynamic graphs and matrices
☆90Nov 14, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
correaa / boost-multi
View on GitHub
Multidimensional arrays for C++. (Not an official Boost library) \\ This is a mirror of gitlab.com/correaa/boost-multi
☆20Updated this week
UoB-HPC / BabelStream
View on GitHub
STREAM, for lots of devices written in many programming models
☆370Jun 15, 2026Updated last month
UoB-HPC / minifmm
View on GitHub
☆11Aug 8, 2021Updated 4 years ago
ecrc / hicma
View on GitHub
HiCMA: Hierarchical Computations on Manycore Architectures
☆37Mar 19, 2023Updated 3 years ago
ORNL-CEES / mfmg
View on GitHub
MFMG is an open-source library implementing matrix-free multigrid methods.
☆18Oct 4, 2019Updated 6 years ago
gunrock / loops
View on GitHub
🎃 GPU load-balancing library for regular and irregular computations.
☆67Jun 25, 2026Updated 3 weeks ago
llnl / irep
View on GitHub
A tool for filling C/C++ or Fortran data structures from Lua input tables
☆15Apr 7, 2026Updated 3 months ago