cjang / GATLASLinks
GPU Automatically Tuned Linear Algebra Software
☆28Updated 10 years ago
Alternatives and similar repositories for GATLAS
Users that are interested in GATLAS are comparing it to the libraries listed below
Sorting:
- A managed platform and language for GPGPU☆32Updated 13 years ago
- Portable 128-bit SIMD intrinsics☆59Updated 2 years ago
- A portable high-level API with CUDA or OpenCL back-end☆55Updated 8 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 3 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆78Updated 4 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆90Updated 10 years ago
- A library for unconstrained minimization of smooth functions using Newton's method or L-BFGS.☆37Updated 7 years ago
- Fast matrix multiplication☆31Updated 4 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆58Updated last year
- Scientific library for high-precision computations and research☆49Updated 8 years ago
- Accelerator Programming Library in C++☆57Updated 7 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 5 months ago
- VIGRA2 based on xtensor☆10Updated 7 years ago
- Research library for compile time optimization☆12Updated 6 years ago
- Python bindings for libNVVM☆37Updated 11 years ago
- Sample implementation of a proposed C++ hashing framework☆29Updated 10 years ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆73Updated 5 years ago
- C++ modelling library for integer programming☆15Updated 5 years ago
- Fork of magma to include more BLAS☆28Updated 9 years ago
- Vector Math Library☆84Updated last month
- A lightweight C++ framework for vectorizing image-processing code☆76Updated 8 years ago
- Examples from Second Edition of Discovering Modern C++☆21Updated 7 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆37Updated 9 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- A well-documented C++ implementation of the cover tree datastructure for quick k-nearest-neighbor search. Allows single-point insertion a…☆62Updated 7 years ago
- Intel(R) Concurrent Collections for C++☆116Updated 2 years ago
- ArrayFire's Machine Learning Library.☆105Updated 7 years ago
- A Light-weight and Fast Template Matrix Library☆133Updated 12 years ago
- Library to program with streams, events, and to queue own functions into a stream.☆15Updated 2 weeks ago
- Automatically Tuned Linear Algebra Software (ATLAS)☆189Updated 5 years ago