pdziepak / ranges-gpu
Experimental ranges for CUDA
☆24Updated 6 years ago
Alternatives and similar repositories for ranges-gpu
Users that are interested in ranges-gpu are comparing it to the libraries listed below
Sorting:
- low level system stuff, threads, memory, file descriptors etc☆21Updated 3 months ago
- A fast implementation of log() and exp()☆53Updated 2 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆20Updated 3 years ago
- Polymorphic multidimensional array view☆36Updated 4 years ago
- Seamless llvm-mca CMake integration☆27Updated 5 years ago
- Runtime choosing of template specializations using compile-time lookup-tables. Compile all states of a template function, but execute the…☆26Updated last year
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆96Updated last week
- C++20 fiber implementation with similar interface to std::thread, header-only / x86_64 / Linux only / stackful / built-in scheduler / thr…☆27Updated 7 months ago
- async is a tiny C++ header-only high-performance library for async calls handled by a thread-pool, which is built on top of an unbounded …☆29Updated 4 years ago
- Vectorization EDSL library☆15Updated 5 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- ☆26Updated 5 years ago
- A vectorizable multi-dimensional iterator for C++ using the Coroutines TS☆12Updated 2 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆23Updated 3 weeks ago
- UME::SIMD A library for explicit simd vectorization.☆90Updated 7 years ago
- Dynamic loading examples outside CppCon 2021☆16Updated 3 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆18Updated 5 years ago
- Collaborating on papers for the ISO C++ committee - public repo☆26Updated 9 months ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- C library plusifier☆10Updated 3 years ago
- ☆70Updated 4 years ago
- a CUDA implementation of a priority queue☆84Updated 4 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated this week
- Interchangeable AoS and SoA containers☆23Updated 2 years ago
- Polymorphic memory resource for real-time applications.☆67Updated last year
- CUDA executors☆14Updated 4 years ago
- C++11 Header-only continuous-storage Double ended vector implementation similar to STL's std::vector for efficient insertions/removals at…☆14Updated 2 years ago
- Adobe's C++ Performance Benchmarks for modern compilers (and build systems)☆12Updated 5 years ago
- "Small Vector" optimization for Modern C++: store up to a small number of items on the stack☆34Updated 4 years ago