jeremad / cuda-travisLinks
☆19Updated 5 years ago
Alternatives and similar repositories for cuda-travis
Users that are interested in cuda-travis are comparing it to the libraries listed below
Sorting:
- Full-speed Array of Structures access☆171Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- CUDA kernel author's tools☆111Updated 3 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Training examples for SYCL☆43Updated 2 months ago
- A library to benchmark CUDA code, similar to google benchmark.☆29Updated 4 years ago
- The C++ Standard Library for your entire system.☆18Updated 2 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 6 months ago
- RAJA Performance Suite☆118Updated this week
- Subset of BLAS routines optimized for NVIDIA GPUs☆71Updated 2 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆546Updated last month
- A GPU accelerated error-bounded lossy compression for scientific data.☆86Updated last month
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 3 months ago
- Distributed View Extension for Kokkos☆47Updated 7 months ago
- Use CUDA intrinsics with user-defined types☆47Updated 10 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated last year
- Advanced Profiling and Analytics for AMD Hardware☆159Updated this week
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆165Updated last year
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆361Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆172Updated this week
- STREAM, for lots of devices written in many programming models☆345Updated 10 months ago
- ☆30Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆109Updated 2 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- A task benchmark☆43Updated 11 months ago
- SYCL Open Source Specification☆136Updated this week
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆348Updated 3 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆130Updated 3 weeks ago