jaredhoberock / cudexLinks
CUDA executors
☆14Updated 5 years ago
Alternatives and similar repositories for cudex
Users that are interested in cudex are comparing it to the libraries listed below
Sorting:
- A reference implementation of std::simd, providing data parallel types in the C++ standard☆13Updated 5 years ago
- Collection of ktl header libraries - small C++20 wrapper utilities☆11Updated 3 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆21Updated 4 years ago
- Runtime choosing of template specializations using compile-time lookup-tables. Compile all states of a template function, but execute the…☆26Updated 3 weeks ago
- Pre-conditions, post-conditions, and assertions, all available at compile-time☆26Updated 3 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- DARMA/vt => Virtual Transport☆38Updated last week
- Experimental ranges for CUDA☆25Updated 6 years ago
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- DARMA/magistrate => Serialization and checkpointing library☆12Updated 2 weeks ago
- A simple C++17 vector math library☆10Updated 4 years ago
- Collaborating on papers for the ISO C++ committee - public repo☆27Updated 2 months ago
- C++20 and onward collection of high performance data containers and related tools☆57Updated last week
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Resource-based, Declarative task-Graphs for Parallel, Event-driven Scheduling☆26Updated last year
- A plotting tool☆32Updated 5 years ago
- A ring_span implementation that allows zero construction and destruction☆16Updated 5 years ago
- Object-oriented extension to the CMake language.☆13Updated 7 months ago
- Volume Manipulation Library☆17Updated 2 years ago
- A Low-Level Abstraction of Memory Access☆93Updated last year
- async is a tiny C++ header-only high-performance library for async calls handled by a thread-pool, which is built on top of an unbounded …☆32Updated 5 years ago
- ☆11Updated 3 years ago
- CMake module to optimize cflags for architecture extensions such as SSE, AVX☆27Updated 10 months ago
- 🔎 Have your bits and eat them too! A C++17 bit lens container for vector types.☆22Updated 5 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆24Updated 2 months ago
- CPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.☆44Updated last month
- Compiler-assisted variable size benchmarking for the study of C++ metaprogram compile times.☆26Updated last year
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Updated 6 years ago
- C++11 Header-only continuous-storage Double ended vector implementation similar to STL's std::vector for efficient insertions/removals at…☆16Updated 3 years ago
- C++ data structure to manipulate SoA (Structure of Arrays) easily (previously named AggregateVector)☆49Updated 7 years ago