lumianph / gpuprec
gpuprec: Extended-Precision Libraries on GPUs
☆35Updated 9 years ago
Alternatives and similar repositories for gpuprec:
Users that are interested in gpuprec are comparing it to the libraries listed below
- sparse matrix pre-processing library☆81Updated 8 months ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 9 years ago
- Compute applications.☆24Updated 5 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆104Updated last year
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆49Updated 4 months ago
- Full-speed Array of Structures access☆164Updated last year
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆40Updated 3 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 5 months ago
- mallocMC: Memory Allocator for Many Core Architectures☆53Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated this week
- Par4All is an automatic parallelizing and optimizing compiler (workbench) for C and Fortran sequential programs☆52Updated 9 years ago
- Counter-based random number generators for C, C++ and CUDA.☆91Updated 11 months ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- Mandelbrot fractal on NVidia GPUs using CUDA dynamic parallelism and Mariani-Silver algorithm☆28Updated 10 years ago
- A mini-app to represent the multipole resonance representation lookup cross section algorithm.☆22Updated last year
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆34Updated last year
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Next generation FFT implementation for ROCm☆184Updated this week
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆49Updated last year
- Automatically exported from code.google.com/p/patus☆15Updated 9 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆7Updated 3 weeks ago
- High-level C++ for Accelerator Clusters☆143Updated this week
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 7 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 3 months ago
- Use CUDA intrinsics with user-defined types☆47Updated 10 years ago
- Data parallel C++ mathematical object library☆158Updated this week