ChenhanYu / hmlpLinks
High-Performance Machine Learning Primitives
☆12Updated 4 years ago
Alternatives and similar repositories for hmlp
Users that are interested in hmlp are comparing it to the libraries listed below
Sorting:
- ☆34Updated 2 months ago
- A parallel kernel-independent FMM library for particle and volume potentials☆58Updated last month
- H2 Matrix Package☆31Updated 2 years ago
- A Massively Parallel FFT Library for CPU/GPU☆58Updated 5 years ago
- H2Opus: a performance-oriented library for hierarchical matrices☆19Updated 3 years ago
- Structured Matrix Package (LBNL)☆182Updated 2 months ago
- sparse matrix pre-processing library☆83Updated last year
- Software libraries that implement hierarchical matrices☆61Updated 5 months ago
- Fast gradient evaluation in C++ based on Expression Templates.☆106Updated last week
- HiCMA: Hierarchical Computations on Manycore Architectures☆34Updated 2 years ago
- ☆57Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last month
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆46Updated last year
- The parGeMSLR is an MPI-based sparse linear system solution/preconditioning package implementation with C++.☆25Updated 3 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆131Updated last month
- AMD optimized Sparse Linear Algebra library☆34Updated last month
- Zoltan Dynamic Load Balancing and Graph Algorithm Toolkit -- Distribution site☆40Updated 2 years ago
- A C++ library for computing large scale tensor contractions.☆38Updated 7 years ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- PLASMA is a software package for solving problems in dense linear algebra using OpenMP☆35Updated 4 months ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Updated last year
- Structured PIC proxy app based on Cabana☆15Updated 5 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 4 months ago
- MiniFE Finite Element Mini-Application☆37Updated last year
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆72Updated 2 weeks ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- directory for randomized cholesky☆18Updated 2 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated this week
- GPU Eigensolver for symmetric/hermitian matrices.☆67Updated 4 years ago
- A fast shared & distributed memory task-based runtime in C++☆28Updated 4 years ago