NVIDIA / CoMD-CUDALinks
GPU implementation of classical molecular dynamics proxy application.
☆31Updated 8 years ago
Alternatives and similar repositories for CoMD-CUDA
Users that are interested in CoMD-CUDA are comparing it to the libraries listed below
Sorting:
- Compute applications.☆25Updated 6 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- Range-based for loops to iterate over a range of numbers or values☆34Updated 9 years ago
- ☆74Updated 2 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated 2 years ago
- ☆87Updated 8 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- The SparseX sparse kernel optimization library☆43Updated 6 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆46Updated last year
- Classical molecular dynamics proxy application.☆32Updated 5 years ago
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 9 years ago
- Launching collective tasks in bulk☆37Updated 6 years ago
- Source code from NVIDIA CUDACasts☆48Updated 11 years ago
- a software library containing Sparse functions written in OpenCL☆175Updated 5 years ago
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Updated 7 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆95Updated 3 weeks ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- A mirror of cinch's internal gitlab repository.☆21Updated 3 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆90Updated 2 weeks ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆48Updated 10 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆110Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 5 months ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆110Updated 5 months ago
- A C++ allocator based on cudaMallocManaged☆23Updated 7 years ago
- A task benchmark☆44Updated last year
- Examples for using SYCL on CUDA☆62Updated 3 months ago
- ☆16Updated last month
- Livermore Big Artificial Neural Network Toolkit☆229Updated 7 months ago