bassoy / ttvLinks
C++ Header-Only Library for High-Performance Tensor-Vector Multiplication
☆22Updated 9 months ago
Alternatives and similar repositories for ttv
Users that are interested in ttv are comparing it to the libraries listed below
Sorting:
- ☆31Updated last month
- DARMA/vt => Virtual Transport☆38Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆90Updated 3 weeks ago
- DLA-Future☆78Updated this week
- Portable HPC Containers (C++)☆48Updated 3 weeks ago
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 2 months ago
- Collaborating on papers for the ISO C++ committee - public repo☆27Updated last year
- A Low-Level Abstraction of Memory Access☆88Updated last year
- C++ Library for Object-oriented Programming with Structure of Arrays Layout☆21Updated 7 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆109Updated last week
- DARMA/magistrate => Serialization and checkpointing library☆12Updated last week
- Vectorised data model base and helper classes.☆20Updated last week
- C++ User interface for the Platform independent Library Alpaka☆39Updated last year
- Department of Energy Standard Utility Library☆32Updated this week
- TTG: Template Task Graph C++ API☆26Updated 3 months ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago
- Official BOLT Repository☆31Updated last year
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆31Updated last week
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆21Updated 3 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆52Updated 2 weeks ago
- A fast shared & distributed memory task-based runtime in C++☆28Updated 4 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Global Memory and Threading runtime system☆25Updated last year
- Implementation of AMD HIP for CPUs☆23Updated 5 years ago
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆74Updated last week
- Concurrent CPU-GPU Programming using Task Models☆103Updated 5 years ago
- Little OpenMP Library☆167Updated 3 years ago
- List all available information about all SYCL devices and platforms☆15Updated 5 years ago