jaredhoberock / thrust-workshopLinks
Introductory Thrust workshop materials
☆44Updated 12 years ago
Alternatives and similar repositories for thrust-workshop
Users that are interested in thrust-workshop are comparing it to the libraries listed below
Sorting:
- Execution primitives for C++☆154Updated 5 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆419Updated 4 months ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆348Updated 3 years ago
- Launching collective tasks in bulk☆37Updated 6 years ago
- Range-based for loops to iterate over a range of numbers or values☆34Updated 9 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- a software library containing Sparse functions written in OpenCL☆175Updated 5 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- a CUDA implementation of a priority queue☆84Updated 5 years ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 5 years ago
- CUDA kernel author's tools☆114Updated 3 years ago
- Set of guidelines for porting OpenCL™ C to OpenCL C++☆41Updated 8 years ago
- CMake find module for Intel Threading Building Blocks☆90Updated 7 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- CUDA Data Parallel Primitives Library☆437Updated 7 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆293Updated 4 years ago
- Use CUDA intrinsics with user-defined types☆48Updated 11 years ago
- ☆33Updated 3 months ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆325Updated 2 years ago
- Example of how to use CUDA with CMake >= 3.8☆70Updated 6 months ago
- mallocMC: Memory Allocator for Many Core Architectures☆58Updated 2 weeks ago
- A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms☆162Updated last year
- K-d tree implementation in C++☆58Updated 13 years ago
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 4 months ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 7 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆42Updated 12 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆182Updated 3 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 5 years ago
- Concurrent CPU-GPU Programming using Task Models☆105Updated 5 years ago