lumianph / gpuprec
gpuprec: Extended-Precision Libraries on GPUs
☆34Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpuprec
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- sparse matrix pre-processing library☆81Updated 6 months ago
- Kernel Tuning Toolkit☆55Updated 3 weeks ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- Use CUDA intrinsics with user-defined types☆47Updated 10 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆76Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆102Updated last year
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆104Updated 3 months ago
- mallocMC: Memory Allocator for Many Core Architectures☆51Updated last week
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆33Updated last year
- Counter-based random number generators for C, C++ and CUDA.☆89Updated 9 months ago
- Launching collective tasks in bulk☆36Updated 5 years ago
- Full-speed Array of Structures access☆162Updated last year
- Sample programs for the LLVM PTX back-end☆34Updated 9 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆65Updated last year
- Implementation of the Remez algorithm☆13Updated last year
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆48Updated 3 months ago
- Compute applications.☆25Updated 4 years ago
- ☆75Updated last year
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 2 months ago
- makefile.include☆20Updated 3 years ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- ulmBLAS☆104Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- Examples for HIP☆200Updated 2 weeks ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- C++11 Message Passing☆74Updated 2 years ago