fynv / ThrustRTCLinks
CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
☆59Updated 3 years ago
Alternatives and similar repositories for ThrustRTC
Users that are interested in ThrustRTC are comparing it to the libraries listed below
Sorting:
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 3 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆73Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated 2 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆112Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- DLA-Future☆80Updated this week
- A library of various helper routines and frameworks used by many of the lab's software☆70Updated 3 months ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- CUDA kernel author's tools☆113Updated 3 years ago
- Full-speed Array of Structures access☆174Updated 2 years ago
- MagmaDNN: a simple deep learning framework in c++☆50Updated 5 years ago
- Kernel Tuning Toolkit☆65Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated this week
- Exploring using stdpar and Cython☆34Updated 4 years ago
- Implementation of AMD HIP for CPUs☆23Updated 5 years ago
- A task benchmark☆44Updated last year
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 8 years ago
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆43Updated 2 years ago
- Concurrent CPU-GPU Programming using Task Models☆104Updated 5 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆92Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 9 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆210Updated this week
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 5 years ago
- A pseudo random number generator library written against the SYCL API.☆11Updated 6 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆21Updated 3 years ago
- ☆19Updated 6 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆89Updated last week
- ☆23Updated 3 years ago