bennylp / awesome-cpp-ml
My curated list of C++ (GPU) BLAS libraries and machine learning/reinforcement learning frameworks
☆23Updated 4 years ago
Related projects: ⓘ
- ☆56Updated this week
- Serial and parallel implementations of matrix multiplication☆34Updated 3 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆53Updated 2 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- A cross-platform CUDA/C++17 starter project with google test and google benchmark support.☆35Updated last year
- Deep Learning With C++☆29Updated 6 years ago
- C++ Neural Network Library☆12Updated 4 months ago
- ☆19Updated 8 years ago
- Examples from the "C++ From Scratch" Series☆57Updated last year
- A Low-Level Abstraction of Memory Access☆79Updated 6 months ago
- ☆41Updated 3 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆90Updated 2 years ago
- AMD’s C++ library for accelerating tensor primitives☆35Updated this week
- A collection of code examples for learning parallel programming concepts☆51Updated 3 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated last month
- CUDA Template Functions☆18Updated last month
- GPU implementation of classical molecular dynamics proxy application.☆29Updated 7 years ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆15Updated 4 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆43Updated 2 years ago
- ☆56Updated 3 weeks ago
- A collection of awesome algorithms, implemented in CUDA.☆24Updated 6 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 3 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 8 years ago
- My C++ deep learning framework & other machine learning algorithms☆77Updated last year
- Parallel Algorithm Scheduling Library☆101Updated 7 years ago
- CMake modules used within the ROCm libraries☆59Updated this week
- Learn OpenMP examples step by step☆81Updated 3 years ago
- A library of various helper routines and frameworks used by many of the lab's software☆39Updated 4 months ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 5 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆27Updated 7 months ago