cjang / GATLAS
GPU Automatically Tuned Linear Algebra Software
☆28Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for GATLAS
- A managed platform and language for GPGPU☆32Updated 11 years ago
- Accelerator Programming Library in C++☆57Updated 6 years ago
- Library to program with streams, events, and to queue own functions into a stream.☆16Updated 4 months ago
- C++ heterogeneous and lock-free containers☆13Updated 6 years ago
- Vectorized intersections (research code)☆14Updated 7 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 2 years ago
- finding set bits in large bitmaps☆15Updated 8 years ago
- CMake module collection☆30Updated 9 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- Research library for compile time optimization☆12Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- General Stride K-Nearest Neighbors☆13Updated 3 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 2 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- Scientific library for high-precision computations and research☆50Updated 7 years ago
- Generate and execute native code at run time, from Python☆51Updated 4 months ago
- A mirror of cinch's internal gitlab repository.☆22Updated 2 years ago
- Sample implementation of a proposed C++ hashing framework☆29Updated 9 years ago
- C++Now 2016 talk - Pulling Visitors: Boost.Graph + Boost.Coroutine☆9Updated 8 years ago
- Programming Accelerators with C++ (PACXX)☆58Updated 6 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆91Updated 9 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 8 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 3 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆51Updated last year
- Non-blocking message passing (a C++14 MPI wrapper)☆18Updated 10 years ago