ChenhanYu / hmlpLinks
High-Performance Machine Learning Primitives
☆12Updated 4 years ago
Alternatives and similar repositories for hmlp
Users that are interested in hmlp are comparing it to the libraries listed below
Sorting:
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆45Updated last year
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated last month
- ☆32Updated last week
- H2Opus: a performance-oriented library for hierarchical matrices☆18Updated 2 years ago
- Structured PIC proxy app based on Cabana☆15Updated 2 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆31Updated 2 years ago
- MiniFE Finite Element Mini-Application☆35Updated last year
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- A Massively Parallel FFT Library for CPU/GPU☆56Updated 4 years ago
- ☆16Updated 4 years ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆37Updated 9 months ago
- ☆76Updated last month
- A parallel kernel-independent FMM library for particle and volume potentials☆55Updated 3 months ago
- sparse matrix pre-processing library☆83Updated last year
- H2 Matrix Package☆31Updated 2 years ago
- Software libraries that implement hierarchical matrices☆59Updated 2 months ago
- Molecular dynamics proxy application based on Kokkos☆34Updated last year
- Zoltan Dynamic Load Balancing and Graph Algorithm Toolkit -- Distribution site☆39Updated 2 years ago
- AMD optimized Sparse Linear Algebra library☆32Updated last month
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated 10 months ago
- A C++ library for computing large scale tensor contractions.☆38Updated 7 years ago
- Training examples for SYCL☆49Updated this week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆69Updated 2 weeks ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 4 years ago
- PLASMA is a software package for solving problems in dense linear algebra using OpenMP☆32Updated 2 weeks ago
- Fast gradient evaluation in C++ based on Expression Templates.☆102Updated last month
- Algebraic multigrid benchmark☆34Updated last year
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated last month