cjang / GATLAS
GPU Automatically Tuned Linear Algebra Software
☆28Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for GATLAS
- A managed platform and language for GPGPU☆32Updated 11 years ago
- Accelerator Programming Library in C++☆57Updated 6 years ago
- C++ heterogeneous and lock-free containers☆13Updated 6 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- General Stride K-Nearest Neighbors☆13Updated 3 years ago
- Library to program with streams, events, and to queue own functions into a stream.☆16Updated 4 months ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 3 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 2 years ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- Vectorized intersections (research code)☆14Updated 7 years ago
- C++ Summer Lecture Series 2016☆13Updated 8 years ago
- BSPLib is a fast, and easy to use C++ implementation of the Bulk Synchronous Parallel (BSP) threading model.☆20Updated 6 years ago
- A mirror of cinch's internal gitlab repository.☆22Updated 2 years ago
- Intel(R) Concurrent Collections for C++☆115Updated last year
- Sample implementation of a proposed C++ hashing framework☆29Updated 9 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 8 years ago
- Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations☆16Updated 11 years ago
- A Halide journey taken for pleasure, this repo will hopefully serve a collection of Halide imaging functions that are useful to the commu…☆15Updated 9 years ago
- Non-blocking message passing (a C++14 MPI wrapper)☆18Updated 10 years ago
- Distributed machine learning platform☆12Updated 9 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆91Updated 9 years ago
- A library for unconstrained minimization of smooth functions using Newton's method or L-BFGS.☆35Updated 6 years ago
- Generate and execute native code at run time, from Python☆51Updated 3 months ago
- MCMC for the Dark Energy Spectroscopic Instrument☆13Updated 8 years ago
- Programming Accelerators with C++ (PACXX)☆58Updated 6 years ago
- finding set bits in large bitmaps☆15Updated 8 years ago
- C++Now 2016 talk - Pulling Visitors: Boost.Graph + Boost.Coroutine☆9Updated 8 years ago