Gram21 / GPUSortingLinks

Implementation of a few sorting algorithms in OpenCL

☆35

Alternatives and similar repositories for GPUSorting

Users that are interested in GPUSorting are comparing it to the libraries listed below

Sorting:

Glavnokoman / vulkan-compute-example
Simple example of using Vulkan for GPGPU computing
☆55Updated 6 years ago
CNugteren / CLCudaAPI
A portable high-level API with CUDA or OpenCL back-end
☆54Updated 7 years ago
halide / visual_debugger
☆28Updated 6 years ago
milakov / int_fastdiv
Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.
☆71Updated 9 years ago
intel / clGPU
☆68Updated 2 years ago
apc-llc / whippletree
Whippletree, a novel approach to scheduling dynamic, irregular workloads on the GPU
☆22Updated 9 years ago
eyalroz / libgiddy
Giddy - A lightweight GPU decompression library
☆42Updated 6 years ago
AnyDSL / traversal
AnyDSL traversal code
☆15Updated 6 years ago
wjakob / dset
Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank
☆66Updated 10 years ago
Lichtso / VulkanFFT
Fast Fourier Transform using the Vulkan API
☆34Updated 4 years ago
cyrillefavreau / Sol-R
Speed of light ray-tracer
☆23Updated 7 years ago
godefv / math
This is a C++ math library, with a focus on geometry.
☆29Updated 4 years ago
Maratyszcza / psimd
Portable 128-bit SIMD intrinsics
☆58Updated 2 years ago
9prady9 / CLGLInterop
OpenCL-OpenGL Interop examples
☆43Updated 5 years ago
bfierz / vcl
Visual Computing Library
☆20Updated 3 months ago
vinjn / GpuProf
Realtime GPU Profiler for AMD / NVIDIA / Intel GPUs
☆32Updated last year
dillonhuff / scg
3D Computational Geometry in C++11
☆20Updated 6 years ago
Xilinx / triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
☆77Updated 4 years ago
pdziepak / ranges-gpu
Experimental ranges for CUDA
☆24Updated 6 years ago
tcoppex / polytri
🔺 Fast and simple polygon triangulation library.
☆43Updated 7 years ago
ashvardanian / ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
☆99Updated last month
Auburn / FastSIMD
Low level generic SIMD wrapper for x86, ARM, WASM with dynamic dispatch
☆38Updated 7 months ago
Maratyszcza / FXdiv
C99/C++ header-only library for division via fixed-point multiplication by inverse
☆54Updated last year
bchoi / ParKD
Parallel k-D Tree Construction
☆57Updated 13 years ago
sergiud / SuiteSparse
SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support
☆53Updated 3 weeks ago
ProGTX / sycl-gtx
Implementation of the SYCL specification.
☆66Updated last year
mgopshtein / cudacpp
C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.
☆55Updated 6 years ago
dimitrs / cpp-opencl
C++ to OpenCL C Source-to-source Translation
☆13Updated 11 years ago
ramenhut / image-resampler
A flexible image resampling library
☆43Updated 8 years ago
alpaka-group / mallocMC
mallocMC: Memory Allocator for Many Core Architectures
☆58Updated 2 months ago