frozenca / Ndim-Matrix
C++20 N-dimensional Matrix class for hobby project
☆22Updated 2 years ago
Related projects: ⓘ
- A detailed conversion of a C++ project to Python using pybind11☆18Updated 2 years ago
- Installing and Test PyTorch C++ API on Ubuntu with GPU enabled☆22Updated 8 months ago
- Fastest Random Distribution Generator for Eigen☆91Updated last week
- Some CUDA design patterns and a bit of template magic for CUDA☆144Updated last year
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆29Updated last year
- Shared Pointer for Cuda Device Pointers and Cuda Streams, Smart Wrapper to Allocate and Deallocate Cuda Device Buffer.☆25Updated last year
- Exploring using stdpar and Cython☆32Updated 3 years ago
- A C++ header-only for data transfer between linear algebra libraries (Eigen, Armadillo, OpenCV, ArrayFire, LibTorch).☆79Updated 4 months ago
- Generate simple index ranges in C++ and CUDA C++☆38Updated last year
- High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc.☆14Updated 4 months ago
- Example of wrapping CGAL Delaunay triangulations and mesh refinement using pybind11☆42Updated 5 years ago
- Fully-working mlpack example programs☆116Updated last month
- Serial and parallel implementations of matrix multiplication☆34Updated 3 years ago
- Learn OpenMP examples step by step☆81Updated 3 years ago
- FastAD is a C++ implementation of automatic differentiation both forward and reverse mode.☆98Updated 11 months ago
- ☆21Updated 2 years ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆21Updated 3 years ago
- Introduction to CUDA programming☆111Updated 7 years ago
- Fast, multithreaded, AVX/FMA matrix multiplication kernel in C++ 17☆17Updated 5 years ago
- CUDA kernel author's tools☆105Updated 2 years ago
- Study parallel programming - CUDA, OpenMP, MPI, Pthread☆54Updated 2 years ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆129Updated 8 months ago
- ☆56Updated 3 weeks ago
- Examples for using SYCL on CUDA☆59Updated 2 weeks ago
- C++ Matrix -- High performance and accurate (e.g. edge cases) matrix math library with expression template arithmetic operators☆113Updated 5 months ago
- Automatic Differentiation C++ Library☆56Updated 3 years ago
- Source code examples from the Parallel Forall Blog☆94Updated 5 years ago
- ☆20Updated 5 years ago
- QuantitativeBytes Linear Algebra Library [C++]. A simple implementation of various common linear algebra functions, intended for educatio…☆44Updated last year
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated 8 months ago