bassoy / ttvLinks
C++ Header-Only Library for High-Performance Tensor-Vector Multiplication
☆23Updated 2 months ago
Alternatives and similar repositories for ttv
Users that are interested in ttv are comparing it to the libraries listed below
Sorting:
- ☆33Updated 5 months ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆97Updated last month
- DARMA/magistrate => Serialization and checkpointing library☆12Updated last week
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- DARMA/vt => Virtual Transport☆38Updated last week
- DLA-Future☆82Updated 2 months ago
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 6 months ago
- A Low-Level Abstraction of Memory Access☆93Updated last year
- Global Memory and Threading runtime system☆24Updated last month
- Portable HPC Containers (C++)☆49Updated 2 weeks ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆110Updated last week
- Collaborating on papers for the ISO C++ committee - public repo☆27Updated 2 months ago
- Department of Energy Standard Utility Library☆33Updated last week
- TTG: Template Task Graph C++ API☆26Updated 2 months ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆31Updated last week
- Official BOLT Repository☆31Updated last year
- C++ User interface for the Platform independent Library Alpaka☆39Updated last month
- Vectorised data model base and helper classes.☆20Updated 3 weeks ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 11 months ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆21Updated 4 years ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆51Updated 4 months ago
- Concurrent CPU-GPU Programming using Task Models☆105Updated 6 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Updated 6 years ago
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Scalable High-performance Algorithms and Data-structures☆135Updated last month
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆92Updated 3 months ago
- associative floating point addition☆19Updated last year
- Little OpenMP Library☆169Updated 3 years ago