jeremad / cuda-travisLinks
☆19Updated 5 years ago
Alternatives and similar repositories for cuda-travis
Users that are interested in cuda-travis are comparing it to the libraries listed below
Sorting:
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- ☆29Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 2 months ago
- The C++ Standard Library for your entire system.☆18Updated last month
- Training examples for SYCL☆42Updated last month
- CUDA kernel author's tools☆111Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Full-speed Array of Structures access☆169Updated 2 years ago
- Online CUDA Occupancy Calculator☆76Updated 3 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆43Updated last year
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆8Updated 3 months ago
- HTML/JS port of CUDA Occupancy Calculator☆17Updated 3 years ago
- Distributed Performance-portable Stencil Compuitation☆10Updated last year
- Distributed View Extension for Kokkos☆46Updated 6 months ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- ☆18Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- A task benchmark☆42Updated 10 months ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆40Updated 4 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated 11 months ago
- ☆29Updated 5 years ago
- Autonomic Performance Environment for eXascale (APEX)☆48Updated 3 weeks ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆35Updated 3 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- RAJA Performance Suite☆117Updated this week
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 5 months ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago