cjang / GATLASLinks
GPU Automatically Tuned Linear Algebra Software
☆28Updated 10 years ago
Alternatives and similar repositories for GATLAS
Users that are interested in GATLAS are comparing it to the libraries listed below
Sorting:
- A managed platform and language for GPGPU☆32Updated 13 years ago
- Portable 128-bit SIMD intrinsics☆59Updated 2 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Updated 8 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 4 years ago
- Scientific library for high-precision computations and research☆49Updated 8 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆78Updated 5 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆59Updated last year
- Accelerator Programming Library in C++☆56Updated 7 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆91Updated 10 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 8 months ago
- Sample implementation of a proposed C++ hashing framework☆29Updated 10 years ago
- Programming Accelerators with C++ (PACXX)☆58Updated 7 years ago
- C++ modelling library for integer programming☆15Updated 5 years ago
- A library for unconstrained minimization of smooth functions using Newton's method or L-BFGS.☆38Updated 7 years ago
- Python bindings for libNVVM☆38Updated 11 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆26Updated 6 years ago
- Fast matrix multiplication☆31Updated 4 years ago
- Intel(R) Concurrent Collections for C++☆116Updated 3 years ago
- A lightweight C++ framework for vectorizing image-processing code☆76Updated 8 years ago
- A Light-weight and Fast Template Matrix Library☆132Updated 12 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆42Updated 12 years ago
- VIGRA2 based on xtensor☆10Updated 7 years ago
- A Nonlinear Least Squares Minimizer☆35Updated 13 years ago
- Generalized Histograms for CUDA-capable GPUs☆43Updated 10 years ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago
- Research library for compile time optimization☆12Updated 7 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- UME::SIMD A library for explicit simd vectorization.☆91Updated 8 years ago
- A computational geometry library for C++ and Python☆86Updated 4 years ago
- Shader-Like Mathematical Expression JIT Engine for C++ Language☆60Updated 6 years ago