High-Performance Machine Learning Primitives
☆13Apr 17, 2021Updated 5 years ago
Alternatives and similar repositories for hmlp
Users that are interested in hmlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 5 years ago
- Structured Matrix Package (LBNL)☆194Jun 6, 2026Updated last week
- Software libraries that implement hierarchical matrices☆62Jun 19, 2025Updated 11 months ago
- ☆62Jun 5, 2026Updated last week
- Flatiron Institute Fast Multipole Libraries --- This codebase is a set of libraries to compute N-body interactions governed by the Laplac…☆155Apr 2, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Parallelized BBFMM3D with OpenMP☆32Oct 8, 2025Updated 8 months ago
- Integrated Interface for libraries of eigenvalue decomposition☆10Nov 29, 2024Updated last year
- ☆38Updated this week
- A parallel kernel-independent FMM library for particle and volume potentials☆62Updated this week
- A little library for using SIMD instructions for x86 and ARM, wrapping Agner Fog's vectorclass for x86 and filling some of its functional…☆17May 13, 2026Updated last month
- Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix☆14Jun 3, 2020Updated 6 years ago
- devector and batch_deque containers for C++. See more at: http://erenon.hu/double_ended☆16Oct 7, 2017Updated 8 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆14May 18, 2021Updated 5 years ago
- ☆10Apr 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [SIGGRAPH 2025] Official Implementation of "Instant Self-Intersection Repair for 3D Meshes"☆52Mar 26, 2026Updated 2 months ago
- Source code from "CUDA Fortran for Scientists and Engineers, Second Edition"☆21Mar 31, 2026Updated 2 months ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- Distributed memory, MPI based SuperLU☆220Jun 5, 2026Updated last week
- A suite of stochastic optimization methods for solving the empirical risk minimization problem.☆17Nov 20, 2019Updated 6 years ago
- C++ implementation of the algorithm in "Fast and Accurate Least-Mean-Squares Solvers", NIPS19☆11Mar 4, 2020Updated 6 years ago
- Version 1.2☆13Mar 15, 2017Updated 9 years ago
- Fast linear algebra in MATLAB☆72May 8, 2023Updated 3 years ago
- Strassen's Algorithm for Tensor Contraction☆15Jul 7, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Jul 7, 2017Updated 8 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆48Apr 9, 2016Updated 10 years ago
- The shared memory version of the Alternating Directions Implicit Solver for Isogeometric Analysis☆10Jan 26, 2019Updated 7 years ago
- A (not yet complete) F# Type Provider for Matlab in the spirit of the R Type Provider☆25Dec 3, 2013Updated 12 years ago
- ☆17Apr 8, 2021Updated 5 years ago
- ☆10Jun 4, 2026Updated last week
- Paper: inexact GMRES with fast multipole method and low-p relaxation☆11Aug 23, 2023Updated 2 years ago
- ☆11Jun 11, 2020Updated 6 years ago
- Web frontend for Myria☆12Sep 30, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Parallel implementation of k-means clustering using MPI4PY and PyCUDA.☆10Mar 11, 2019Updated 7 years ago
- BLAS OpenCL implementation.☆17Apr 8, 2015Updated 11 years ago
- Course repository for Cornell CS 6210, Fall 2016☆18Nov 30, 2016Updated 9 years ago
- Rust Optimal Transport solvers☆13Mar 31, 2024Updated 2 years ago
- A Modern Fortran library for fast, approximate math functions☆17Jan 22, 2023Updated 3 years ago
- Python bindings for OpenSHMEM☆27May 11, 2026Updated last month
- Code for "Disentangling images with Lie group transformations and sparse coding" (2023).☆13May 24, 2021Updated 5 years ago