hgomersall / SSE-convolution
A demonstration of speeding up a 1D convolution using SSE
☆51Updated 8 years ago
Alternatives and similar repositories for SSE-convolution:
Users that are interested in SSE-convolution are comparing it to the libraries listed below
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆58Updated 11 years ago
- Vectorizable implementations of some mathematical functions☆103Updated 5 years ago
- Template based C++11 FFT implementation.☆53Updated 10 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 3 years ago
- A matrix and array operation library on GPU with Eigen compatible interface☆98Updated 7 years ago
- FFT (Fast Fourier Transform): SSE, AVX, AVX2☆51Updated 8 years ago
- fast log and exp functions for AVX2/AVX-512☆230Updated last month
- A class for performing principal component analysis using Eigen library☆30Updated 8 years ago
- ☆68Updated 2 years ago
- Automatically exported from code.google.com/p/math-neon☆40Updated 10 years ago
- A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms☆161Updated last year
- Code samples☆64Updated 3 months ago
- a software library containing Sparse functions written in OpenCL☆174Updated 5 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Implementation of the SYCL specification.☆66Updated 10 months ago
- Blazing-fast Expression Templates Library (ETL) with GPU support, in C++☆224Updated last week
- UME::SIMD A library for explicit simd vectorization.☆90Updated 7 years ago
- Header file to translate SSE instructions to ARM NEON instructions☆48Updated 11 years ago
- Mirror of the Cephes C source for reference☆92Updated last year
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆41Updated 11 years ago
- an OpenCL based software library containing random number generation functions☆136Updated 3 years ago
- Launching collective tasks in bulk☆37Updated 5 years ago
- CMake module collection☆30Updated 10 years ago
- A C++ allocator based on cudaMallocManaged☆23Updated 6 years ago
- Full-speed Array of Structures access☆169Updated 2 years ago
- The OpenCL Extension Wrangler Library☆82Updated 8 years ago
- repository for slides and code examples for my MeetingCpp talk in 2015☆12Updated 9 years ago
- Vector Math Library☆79Updated 8 years ago
- choosing FFT library...☆149Updated 2 years ago