intel / HFAV
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for HFAV
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- C++ User interface for the Platform independent Library Alpaka☆37Updated 3 months ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 7 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- IPython / Jupyter integration for pybind11☆66Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- sparse matrix pre-processing library☆81Updated 6 months ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆33Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆38Updated 3 weeks ago
- data-parallel out-of-core library☆48Updated this week
- Kernel Tuning Toolkit☆55Updated 3 weeks ago
- ☆23Updated 5 years ago
- 3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)☆36Updated 4 years ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆38Updated 3 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆29Updated this week
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 2 years ago
- SOSflow : Scalable Observation System for Scientific Workflows☆12Updated 4 years ago
- A unified framework across multiple programming platforms☆33Updated 5 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 2 years ago
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- associative floating point addition☆17Updated 6 months ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆80Updated this week
- CUDA kernel author's tools☆109Updated 2 years ago
- Recursive LAPACK Collection☆42Updated 2 years ago