OpenMathLib / OpenVML
Vector Math Library
☆78Updated 8 years ago
Alternatives and similar repositories for OpenVML:
Users that are interested in OpenVML are comparing it to the libraries listed below
- sparse matrix pre-processing library☆82Updated 10 months ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Fast matrix multiplication☆29Updated 3 years ago
- ulmBLAS☆105Updated 2 years ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆34Updated 2 years ago
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆52Updated last year
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- CNNs in Halide☆23Updated 9 years ago
- ☆75Updated last year
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- High-Performance Tensor Transpose library☆190Updated last year
- Experimental Linear Algebra Performance Studies☆12Updated 8 years ago
- BLAS OpenCL implementation.☆15Updated 9 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 3 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- A Light-weight and Fast Template Matrix Library☆132Updated 12 years ago
- Implementation of the SYCL specification.☆66Updated 9 months ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- Kernel Tuning Toolkit☆59Updated this week
- Full-speed Array of Structures access☆164Updated last year
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 4 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 4 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Portable 128-bit SIMD intrinsics☆58Updated last year