Gram21 / GPUSortingLinks
Implementation of a few sorting algorithms in OpenCL
☆35Updated 5 years ago
Alternatives and similar repositories for GPUSorting
Users that are interested in GPUSorting are comparing it to the libraries listed below
Sorting:
- Simple example of using Vulkan for GPGPU computing☆54Updated 6 years ago
- ☆68Updated 2 years ago
- Implementation of the SYCL specification.☆66Updated 11 months ago
- OpenCL for Visual Studio Code☆43Updated 9 months ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆77Updated 4 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- Experimental ranges for CUDA☆24Updated 6 years ago
- c++ posit implementation☆44Updated last year
- Speed of light ray-tracer☆23Updated 7 years ago
- Realtime GPU Profiler for AMD / NVIDIA / Intel GPUs☆32Updated last year
- Visual Computing Library☆20Updated last month
- Set of guidelines for porting OpenCL™ C to OpenCL C++☆41Updated 8 years ago
- ☆28Updated 6 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆98Updated last week
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- AnyDSL traversal code☆15Updated 6 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated 3 weeks ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- CUDA Extension Wrangler☆24Updated 5 years ago
- The OpenCL Extension Wrangler Library☆82Updated 8 years ago
- Whippletree, a novel approach to scheduling dynamic, irregular workloads on the GPU☆21Updated 9 years ago
- A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms☆161Updated last year
- 🔺 Fast and simple polygon triangulation library.☆43Updated 7 years ago
- Minimal OpenCL program on Windows☆20Updated last year
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- ☆70Updated 4 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- Learn OpenCL step by step.☆135Updated 2 years ago