bassoy / ttvLinks
C++ Header-Only Library for High-Performance Tensor-Vector Multiplication
☆22Updated 10 months ago
Alternatives and similar repositories for ttv
Users that are interested in ttv are comparing it to the libraries listed below
Sorting:
- ☆32Updated 2 months ago
- Portable HPC Containers (C++)☆48Updated last month
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆92Updated last week
- DARMA/vt => Virtual Transport☆39Updated last week
- A Low-Level Abstraction of Memory Access☆92Updated last year
- DLA-Future☆80Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆109Updated 3 weeks ago
- Department of Energy Standard Utility Library☆32Updated 2 weeks ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 3 months ago
- Official BOLT Repository☆31Updated last year
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆31Updated this week
- Collaborating on papers for the ISO C++ committee - public repo☆27Updated last year
- DARMA/magistrate => Serialization and checkpointing library☆12Updated 3 weeks ago
- Vectorised data model base and helper classes.☆20Updated last month
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago
- associative floating point addition☆18Updated last year
- TTG: Template Task Graph C++ API☆26Updated 3 months ago
- A fast shared & distributed memory task-based runtime in C++☆28Updated 4 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆21Updated 3 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Updated 5 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆52Updated last month
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆76Updated this week
- Implementation of AMD HIP for CPUs☆23Updated 5 years ago
- Little OpenMP Library☆168Updated 3 years ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 5 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆89Updated last week
- Presentation materials for the 2016 Berkeley C++ Summit☆14Updated 9 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 7 months ago