ChenhanYu / hmlpLinks
High-Performance Machine Learning Primitives
☆12Updated 4 years ago
Alternatives and similar repositories for hmlp
Users that are interested in hmlp are comparing it to the libraries listed below
Sorting:
- ☆34Updated 2 months ago
- sparse matrix pre-processing library☆83Updated last year
- Software libraries that implement hierarchical matrices☆60Updated 5 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆33Updated 2 years ago
- ☆16Updated 4 years ago
- H2 Matrix Package☆31Updated 2 years ago
- A Massively Parallel FFT Library for CPU/GPU☆58Updated 5 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last week
- H2Opus: a performance-oriented library for hierarchical matrices☆19Updated 3 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆45Updated last year
- Structured Matrix Package (LBNL)☆181Updated last month
- ☆57Updated 2 weeks ago
- Tensor Contraction Code Generator☆39Updated 8 years ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- ☆11Updated 4 years ago
- A parallel kernel-independent FMM library for particle and volume potentials☆57Updated last week
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated last year
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 3 years ago
- AMD optimized Sparse Linear Algebra library☆34Updated last week
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Updated last year
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆72Updated this week
- H2Lib public repository☆61Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆42Updated 3 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆130Updated last month
- Fast gradient evaluation in C++ based on Expression Templates.☆104Updated 2 weeks ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated 3 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 3 months ago
- ☆83Updated 2 weeks ago