AerialMantis / cppcon-parallelism-class
Exercises for CppCon 2018 class on parallelism
☆12Updated 5 years ago
Alternatives and similar repositories for cppcon-parallelism-class:
Users that are interested in cppcon-parallelism-class are comparing it to the libraries listed below
- ☆41Updated 6 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- a CUDA implementation of a priority queue☆83Updated 4 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆54Updated 2 years ago
- Execution primitives for C++☆154Updated 4 years ago
- C++ implementation of concurrent Binary Search Trees☆71Updated 9 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- Introductory Thrust workshop materials☆43Updated 11 years ago
- CMake module collection☆30Updated 9 years ago
- Compile-time-efficient proof-of-concept implementation for std::tuple☆92Updated 3 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- CMake find module for Intel Threading Building Blocks☆90Updated 6 years ago
- A C++ 14 implementation of graph data structures☆36Updated 8 years ago
- Slides and sample code from presentations at our meetup.☆64Updated 4 years ago
- Implementation of n3554, a proposal to include parallelized versions of the STL algorithms into the C++ standard.☆25Updated 8 years ago
- Seamless llvm-mca CMake integration☆26Updated 4 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆65Updated 5 years ago
- LTPV: Light Temporal Performance Viewer☆22Updated 6 years ago
- ☆75Updated last year
- Trie is a lightweight and simple autocompletion data structure written in C++11.☆44Updated 9 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- A cross-platform CUDA/C++17 starter project with google test and google benchmark support.☆37Updated last year
- A Light-weight and Fast Template Matrix Library☆130Updated 11 years ago
- Full-speed Array of Structures access☆164Updated last year
- Launching collective tasks in bulk☆37Updated 5 years ago
- ☆172Updated 6 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- Experimental ranges for CUDA☆25Updated 5 years ago
- ☆68Updated 4 years ago