planetchili / ssehandreliefLinks
SSE optimization tutorial code
☆25Updated 5 years ago
Alternatives and similar repositories for ssehandrelief
Users that are interested in ssehandrelief are comparing it to the libraries listed below
Sorting:
- serialization prototype for tiny-dnn☆15Updated 8 years ago
- Implementation of a few sorting algorithms in OpenCL☆35Updated 5 years ago
- Set of basic classes (vector, matrix, images and memory array) for CPU and GPU☆17Updated 4 years ago
- Portable 128-bit SIMD intrinsics☆58Updated last year
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Visual Computing Library☆20Updated 2 months ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆53Updated last year
- ☆68Updated 2 years ago
- A reference implementation of std::simd, providing data parallel types in the C++ standard☆12Updated 5 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- A Nonlinear Least Squares Minimizer☆35Updated 13 years ago
- FastAC - Amir Said's Arithmetic and Huffman coding library, example code, and documentation☆29Updated 3 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 2 years ago
- Parallel k-D Tree Construction☆57Updated 13 years ago
- Modern C++ Parallel Task Programming Library☆8Updated 6 years ago
- Header file to translate SSE instructions to ARM NEON instructions☆48Updated 11 years ago
- Simple example of using Vulkan for GPGPU computing☆55Updated 6 years ago
- C++ to OpenCL C Source-to-source Translation☆13Updated 11 years ago
- OpenCL-OpenGL Interop examples☆43Updated 5 years ago
- Simple GLSL compilation checker that uses the display driver☆25Updated 8 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆77Updated 4 years ago
- A class for performing principal component analysis using Eigen library☆30Updated 8 years ago
- A library for unconstrained minimization of smooth functions using Newton's method or L-BFGS.☆36Updated 6 years ago
- Computer Graphics Tools library☆21Updated 8 years ago
- Experimental ranges for CUDA☆24Updated 6 years ago
- CMake module collection☆30Updated 10 years ago
- Simple voxelizer make use of CPU SIMD units☆2Updated last month