High-Performance Machine Learning Primitives
☆13Apr 17, 2021Updated 4 years ago
Alternatives and similar repositories for hmlp
Users that are interested in hmlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 4 years ago
- H2 Matrix Package☆31Jul 18, 2023Updated 2 years ago
- hmglib - Hierarchical matrices on GPU(s) library☆13Jul 31, 2018Updated 7 years ago
- Structured Matrix Package (LBNL)☆191Mar 17, 2026Updated last week
- Software libraries that implement hierarchical matrices☆62Jun 19, 2025Updated 9 months ago
- ☆59Feb 27, 2026Updated 3 weeks ago
- Flatiron Institute Fast Multipole Libraries --- This codebase is a set of libraries to compute N-body interactions governed by the Laplac…☆142Mar 10, 2026Updated 2 weeks ago
- Integrated Interface for libraries of eigenvalue decomposition☆10Nov 29, 2024Updated last year
- [SIGGRAPH 2025] Official Implementation of "Instant Self-Intersection Repair for 3D Meshes"☆24Mar 17, 2026Updated last week
- A parallel kernel-independent FMM library for particle and volume potentials☆59Feb 24, 2026Updated last month
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated last year
- A little library for using SIMD instructions for x86 and ARM, wrapping Agner Fog's vectorclass for x86 and filling some of its functional…☆17Dec 10, 2021Updated 4 years ago
- Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix☆15Jun 3, 2020Updated 5 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆14May 18, 2021Updated 4 years ago
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Sep 25, 2016Updated 9 years ago
- ☆10Apr 24, 2023Updated 2 years ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- Distributed memory, MPI based SuperLU☆218Mar 17, 2026Updated last week
- C++ implementation of the algorithm in "Fast and Accurate Least-Mean-Squares Solvers", NIPS19☆11Mar 4, 2020Updated 6 years ago
- A suite of stochastic optimization methods for solving the empirical risk minimization problem.☆17Nov 20, 2019Updated 6 years ago
- Version 1.2☆13Mar 15, 2017Updated 9 years ago
- Strassen's Algorithm for Tensor Contraction☆15Jul 7, 2017Updated 8 years ago
- When you want to be a brilliant man, you should write down something interesting thing for recall.☆12Dec 18, 2022Updated 3 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Jul 7, 2017Updated 8 years ago
- Paper: inexact GMRES with fast multipole method and low-p relaxation☆11Aug 23, 2023Updated 2 years ago
- The shared memory version of the Alternating Directions Implicit Solver for Isogeometric Analysis☆10Jan 26, 2019Updated 7 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆47Apr 9, 2016Updated 9 years ago
- A (not yet complete) F# Type Provider for Matlab in the spirit of the R Type Provider☆25Dec 3, 2013Updated 12 years ago
- ☆17Apr 8, 2021Updated 4 years ago
- An example project with Poetry and scikit-build.☆13Oct 29, 2024Updated last year
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Rust Optimal Transport solvers☆12Mar 31, 2024Updated last year
- Course repository for Cornell CS 6210, Fall 2016☆18Nov 30, 2016Updated 9 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆50Mar 1, 2018Updated 8 years ago
- Python bindings for OpenSHMEM☆26Mar 2, 2026Updated 3 weeks ago
- Code for "Disentangling images with Lie group transformations and sparse coding" (2023).☆13May 24, 2021Updated 4 years ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆153Mar 17, 2026Updated last week
- CPC2018第二届国产CPU并行应用挑战赛决赛☆11Oct 26, 2018Updated 7 years ago
- A Haskell-embedded computer algebra system that knows nothing about algebra, at the core.☆17Mar 16, 2024Updated 2 years ago