karthikeyann / cuda-calculator
HTML/JS port of CUDA Occupancy Calculator
☆17Updated 3 years ago
Alternatives and similar repositories for cuda-calculator:
Users that are interested in cuda-calculator are comparing it to the libraries listed below
- sparse matrix pre-processing library☆81Updated 8 months ago
- Online CUDA Occupancy Calculator☆73Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 9 years ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆34Updated 3 years ago
- Chunky Loop Interaction☆23Updated 5 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆76Updated 2 months ago
- A unified framework across multiple programming platforms☆33Updated 7 months ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Full-speed Array of Structures access☆164Updated last year
- Python wrapper for isl, an integer set library☆74Updated last month
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆67Updated last year
- development repository for the open earth compiler☆79Updated 3 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆36Updated 3 years ago
- A thin wrapper around miOpen and cuDNN☆40Updated last year
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆7Updated last month
- The SparseX sparse kernel optimization library☆39Updated 6 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Chai☆42Updated last year
- Library to plot integer sets and maps☆48Updated 8 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated this week
- Stencil Probe - a stencil microbenchmark☆30Updated 12 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Interoperability examples for OpenACC.☆49Updated 4 years ago
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆109Updated 8 months ago