MagmaDNN / magmadnnLinks
MagmaDNN: a simple deep learning framework in c++
☆50Updated 5 years ago
Alternatives and similar repositories for magmadnn
Users that are interested in magmadnn are comparing it to the libraries listed below
Sorting:
- Subset of BLAS routines optimized for NVIDIA GPUs☆73Updated 2 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆87Updated last week
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 4 years ago
- RAJA Performance Suite☆124Updated this week
- Zoltan Dynamic Load Balancing and Graph Algorithm Toolkit -- Distribution site☆39Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆45Updated last year
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 2 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆209Updated 5 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆60Updated last week
- ☆33Updated last month
- ☆80Updated this week
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆23Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆115Updated last week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆129Updated 4 months ago
- Distributed View Extension for Kokkos☆48Updated 10 months ago
- DLA-Future☆79Updated last week
- ☆77Updated 2 weeks ago
- Compiler toolchain to enable generation of high-level DSLs for geophysical fluid dynamics models☆28Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated last month
- MGARD: MultiGrid Adaptive Reduction of Data☆41Updated last month
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆83Updated last year
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 6 months ago
- MPI accelerator-integrated communication extensions☆37Updated 2 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆112Updated 2 years ago
- SYCL materials for ENCCS workshop☆25Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆134Updated this week
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Updated 11 months ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆132Updated this week
- High-Performance Machine Learning Primitives☆12Updated 4 years ago