uysalere/cuda-matrix-vector-multiplication

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uysalere/cuda-matrix-vector-multiplication)

uysalere / cuda-matrix-vector-multiplication

Matrix-Vector Multiplication Using Shared and Coalesced Memory Access

☆16

Alternatives and similar repositories for cuda-matrix-vector-multiplication

Users that are interested in cuda-matrix-vector-multiplication are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

weifengliu-ssslab / Benchmark_SpTRSV_using_CSC
View on GitHub
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)
☆23Feb 14, 2020Updated 6 years ago
EnigmaHuang / Saad_Book_ForTran
View on GitHub
Some "Formula Translations" for Yousef Saad's book "Iterative Methods for Sparse Linear Systems (2nd Edition)"
☆13Jan 14, 2018Updated 8 years ago
sfilippone / mld2p4-2
View on GitHub
☆14Jul 16, 2020Updated 6 years ago
chemeng / GPGPU-GMRES-Method
View on GitHub
CUDA GPU implementation of GMRES iterative Solver
☆10Apr 16, 2012Updated 14 years ago
accelazh / idea-collection
View on GitHub
☆10Feb 8, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Siddhant-Ray / siddhant-ray.github.io
View on GitHub
A beautiful, simple, clean and responsive Jekyll theme for my academic portfolio.
☆13Updated this week
gevtushenko / block_matrix_format_performance
View on GitHub
☆12Jan 19, 2020Updated 6 years ago
kurenaif / auto_wmake
View on GitHub
OpenFOAM right wmake at the right time
☆11Mar 10, 2019Updated 7 years ago
visiblehawk / foam-extend-4.1
View on GitHub
☆13Jan 18, 2020Updated 6 years ago
gouarin / GenEO
View on GitHub
☆10Jan 13, 2023Updated 3 years ago
magicse / ncnn-hifi-GAN
View on GitHub
ncnn HiFi-GAN
☆30Sep 29, 2024Updated last year
kberkay / Cuda-Matrix-Multiplication
View on GitHub
Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts
☆26Aug 29, 2022Updated 3 years ago
HenkPoley / split-io-scheduler
View on GitHub
Back/forward ported patches based on https://research.cs.wisc.edu/adsl/Software/split/
☆13Apr 30, 2016Updated 10 years ago
arirepo / paraGMRES
View on GitHub
Massively Scalable Parallel GMRES C-code for Sparse System of Equations
☆13Feb 16, 2016Updated 10 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hyln9 / GCNGEMM
View on GitHub
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Jun 16, 2017Updated 9 years ago
llnl / llnl-hires-timers
View on GitHub
C library containing high resolution timer implementation for several platforms.
☆10Oct 20, 2020Updated 5 years ago
JeffyCN / yocto-manifests
View on GitHub
Manifests for my rockchip yocto repo
☆12Jun 10, 2026Updated last month
temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago
BG2BKK / my_benchmark
View on GitHub
benchmark for linux server
☆13Nov 6, 2016Updated 9 years ago
BorisPis / nicmem-asplos22-artifact
View on GitHub
☆18Dec 11, 2023Updated 2 years ago
f32c / fpgarduino
View on GitHub
FPGArduino binary
☆13Aug 5, 2019Updated 6 years ago
YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
clach04 / bitbucket_tools
View on GitHub
random tools for dealing with bitbucket.org repos
☆12Jan 8, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lionleaf / parallel-c-programs
View on GitHub
A wide array of parallel programs using CUDA, OpenCL, MPI, OpenMP and pthreads.
☆14Jan 6, 2015Updated 11 years ago
zhen-xie / IA-SpGEMM
View on GitHub
IA-SPGEMM
☆44Oct 19, 2024Updated last year
vasigavr1 / Odyssey
View on GitHub
☆15May 13, 2022Updated 4 years ago
cubieboard / linux-sdk-kernel-source
View on GitHub
linux-sdk-kernel-source for a10&a20
☆11Feb 24, 2018Updated 8 years ago
AnonymousYWL / MYLIB
View on GitHub
☆18Apr 8, 2022Updated 4 years ago
leonf08 / SMBUS_PMBUS-Stack-STM32F407
View on GitHub
Porting SMBUS/PMBUS Stack Middleware for STM32F407 MCU
☆12Jul 5, 2018Updated 8 years ago
radxa-pkg / rockchip-iqfiles
View on GitHub
Additional camera tuning profiles for Rockchip SoC
☆14Jul 22, 2026Updated last week
hellox-project / HelloX_STM32
View on GitHub
HelloX operating system for STM32 chipset
☆14Jan 18, 2015Updated 11 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TommyWu-fdgkhdkgh / spike-vp
View on GitHub
spike-vp
☆13Feb 5, 2024Updated 2 years ago
Joshua-Riek / linux
View on GitHub
Linux kernel source tree
☆10Aug 7, 2024Updated last year
1847123212 / RK3588_hdk
View on GitHub
RK3588_hdk quad A76 & quad A53
☆15Apr 12, 2022Updated 4 years ago
MarkusSprunck / aparapi-gpu-band-matrix-solver
View on GitHub
Lessons Learned from GPU Experiments with Aparapi
☆13Apr 17, 2016Updated 10 years ago
eccarson / ca-ksms
View on GitHub
Matlab implementations of communication-avoiding Krylov subspace methods
☆12Sep 2, 2021Updated 4 years ago
brian-kelley / CUDA-QR
View on GitHub
A new QR decomposition algorithm implemented in CUDA
☆18Jun 24, 2024Updated 2 years ago
MartDevelopers-Inc / KEA-Hotel-ERP
View on GitHub
Opensource Light Weight Hotel Enterprise Resource Planning System
☆14Feb 5, 2021Updated 5 years ago