kaletap / bfs-cuda-gpuLinks

Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.

☆32

Alternatives and similar repositories for bfs-cuda-gpu

Users that are interested in bfs-cuda-gpu are comparing it to the libraries listed below

Sorting:

rafalk342 / bfs-cuda
Implementation of breadth first search on GPU with CUDA Driver API.
☆50Updated 4 years ago
gunrock / loops
🎃 GPU load-balancing library for regular and irregular computations.
☆62Updated last year
mabdullahsoyturk / HPC-Paper-Notes
My notes on various HPC papers.
☆22Updated 2 years ago
owensgroup / ATOS
Multi-GPU dynamic scheduler using PGAS style cross-GPU communication
☆27Updated last year
GPUPeople / spECK
Efficient SpGEMM on GPU using CUDA and CSR
☆56Updated last year
codyjrivera / tsm2x-imp
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆32Updated 4 years ago
XiaoSong9905 / HPC-Notes
Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]
☆67Updated 2 years ago
poojahira / spmv-cuda
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆40Updated 6 years ago
c3sr / comm_scope
NUMA-aware multi-CPU multi-GPU data transfer benchmarks
☆23Updated last year
escalab / SIMD2
☆31Updated 3 years ago
lixiuhong / batched_gemm
☆39Updated 5 years ago
c3sr / tcu_scope
☆51Updated 6 years ago
owensgroup / BGHT
BGHT: High-performance static GPU hash tables.
☆66Updated 2 months ago
GPUPeople / ACSpGEMM
Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"
☆28Updated 4 years ago
wzsh / wmma_tensorcore_sample
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
☆137Updated 4 years ago
owensgroup / GpuBTree
Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019
☆56Updated 3 years ago
spcl / open-earth-compiler
development repository for the open earth compiler
☆80Updated 4 years ago
gunrock / essentials
❤️ CUDA/C++ GPU graph analytics simplified.
☆31Updated 2 years ago
owensgroup / SlabHash
A warp-oriented dynamic hash table for GPUs
☆73Updated last year
lixiuhong / implicit_gemm_convolution
☆15Updated 6 years ago
oresths / tSparse
A GPU algorithm for sparse matrix-matrix multiplication
☆71Updated 4 years ago
wmmae / wmma_extension
An extension library of WMMA API (Tensor Core API)
☆99Updated 11 months ago
uuudown / Tartan
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
☆65Updated 6 years ago
microsoft / ConvStencil
☆30Updated last year
TiledTensor / TiledCUDA
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆183Updated 5 months ago
getianao / ngAP
ngAP's artifact for ASPLOS'24
☆23Updated last week
ParCIS / Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆89Updated 2 years ago
hgyhungry / ge-spmm
☆107Updated 3 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆40Updated last year
sleeepyjack / warpcore
A Library for fast Hash Tables on GPUs
☆122Updated 2 years ago