frozenca / Ndim-Matrix
C++20 N-dimensional Matrix class for hobby project
☆23Updated 3 years ago
Alternatives and similar repositories for Ndim-Matrix:
Users that are interested in Ndim-Matrix are comparing it to the libraries listed below
- Some CUDA design patterns and a bit of template magic for CUDA☆147Updated last year
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆137Updated last year
- This is a cpp version of popular python computer vision library imutils☆44Updated last year
- ☆58Updated 4 months ago
- A detailed conversion of a C++ project to Python using pybind11☆18Updated 3 years ago
- Fastest Random Distribution Generator for Eigen☆97Updated 4 months ago
- CUDA kernel author's tools☆110Updated 2 years ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆77Updated 5 months ago
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆29Updated last year
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 2 months ago
- A highly optimised C++ library for mathematical applications and neural networks.☆166Updated 2 months ago
- Installing and Test PyTorch C++ API on Ubuntu with GPU enabled☆23Updated last year
- QuantitativeBytes Linear Algebra Library [C++]. A simple implementation of various common linear algebra functions, intended for educatio…☆45Updated last year
- Reference Implementation for stdBLAS☆131Updated last week
- High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc.☆14Updated 8 months ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆24Updated 3 years ago
- Abstractions of memory, allocator, vector, tuple, shared_ptr, unique_ptr, bitset, variant and string working on both CPU and GPU☆31Updated 2 weeks ago
- Fast, multithreaded, AVX/FMA matrix multiplication kernel in C++ 17☆18Updated 6 years ago
- Exploring using stdpar and Cython☆33Updated 4 years ago
- 3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)☆36Updated 4 years ago
- C++ Matrix -- High performance and accurate (e.g. edge cases) matrix math library with expression template arithmetic operators☆116Updated 9 months ago
- Source code examples from the Parallel Forall Blog☆95Updated 5 years ago
- Example of wrapping CGAL Delaunay triangulations and mesh refinement using pybind11☆43Updated 5 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- Parallel k-d tree with C++17, MPI and OpenMP☆31Updated 2 years ago
- Light and self-contained implementation of C++17 parallel algorithms.☆33Updated 2 months ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆74Updated this week
- STL-like containers (array, vector, matrix, cube) useable in device code.☆31Updated 10 months ago
- QRKit is an extension of Eigen C++ library that allows for efficient QR decomposition of sparse matrices with common sparsity patterns.☆16Updated 4 years ago