Gram21 / GPUSorting
Implementation of a few sorting algorithms in OpenCL
☆33Updated 5 years ago
Alternatives and similar repositories for GPUSorting:
Users that are interested in GPUSorting are comparing it to the libraries listed below
- mallocMC: Memory Allocator for Many Core Architectures☆53Updated last week
- Simple example of using Vulkan for GPGPU computing☆53Updated 6 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Whippletree, a novel approach to scheduling dynamic, irregular workloads on the GPU☆21Updated 9 years ago
- Generic SIMD intrinsic to allow for portable SIMD intrinsic programming☆42Updated 10 years ago
- Implementation of the SYCL specification.☆67Updated 7 months ago
- Compute morton keys using a look-up table generated at compile-time.☆31Updated 8 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 11 months ago
- ☆67Updated 2 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Experimental ranges for CUDA☆25Updated 5 years ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆63Updated 9 years ago
- Polyfill some holes in the SSE intrinsics set☆50Updated 2 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 4 years ago
- AnyDSL traversal code☆15Updated 5 years ago
- GLSL like minimalist vector, matrix and quaternion math library for C++11☆39Updated 3 years ago
- Shader-Like Mathematical Expression JIT Engine for C++ Language☆58Updated 5 years ago
- Kernel Tuning Toolkit☆56Updated 3 months ago
- ☆75Updated last year
- ☆26Updated 6 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆109Updated 8 months ago
- vectorized high-level math library☆43Updated 5 years ago
- Set of guidelines for porting OpenCL™ C to OpenCL C++☆40Updated 7 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- ☆56Updated 3 weeks ago
- Half precision floating point C++ library (imported from sourceforge upstream).☆34Updated 7 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- ☆68Updated 4 years ago
- Automatically exported from code.google.com/p/freeocl☆31Updated 7 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆39Updated 2 years ago