Linear algebra subroutines for large SSD-resident dense and sparse matrices
☆29Dec 14, 2020Updated 5 years ago
Alternatives and similar repositories for BLAS-on-flash
Users that are interested in BLAS-on-flash are comparing it to the libraries listed below
Sorting:
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Apr 2, 2025Updated 11 months ago
- Interactive Theorem Proving course using HOL4☆13Jun 21, 2023Updated 2 years ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated last month
- Companion source code for GTC 2014 talk☆11Mar 25, 2014Updated 11 years ago
- BlueDBM hw/sw implementation using the bluespecpcie PCIe library☆12Dec 25, 2022Updated 3 years ago
- iMLBench is a machine learning benchmark suite targeting CPU-GPU integrated architectures.☆11May 29, 2021Updated 4 years ago
- Hybrid BFS on Xilinx Zynq☆18Jun 9, 2015Updated 10 years ago
- ☆10May 4, 2023Updated 2 years ago
- ☆16Jan 5, 2022Updated 4 years ago
- Unifies OS page cache for heterogeneous systems☆12Jul 26, 2019Updated 6 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆23Nov 2, 2025Updated 4 months ago
- Tool to detect and report leaked MPI objects like MPI_Requests and MPI_Datatypes☆14Sep 17, 2014Updated 11 years ago
- The MPI parallel MD-Workbench simulates user activities.☆12Jun 23, 2019Updated 6 years ago
- Simple persistent storage for C++ objects using virtual memory mapping mechanism☆18Nov 23, 2009Updated 16 years ago
- Command-line JSON processor☆14Oct 23, 2019Updated 6 years ago
- Interpolation in 2D and 3D, object-oriented interfaces☆16Jul 5, 2022Updated 3 years ago
- Deft: A Scalable Tree Index for Disaggregated Memory☆23Apr 23, 2025Updated 10 months ago
- Automatic parallelizer for C/C++ code☆15Nov 21, 2019Updated 6 years ago
- generic C++ containers; matrix, triangle matrix, crs sparse matrix, etc.☆12Mar 23, 2018Updated 7 years ago
- Samples for partner application development (OEM, MO, IHV) for Window☆18Jun 12, 2023Updated 2 years ago
- Tunnel is a clean wrapper around native Go channel to allow cleanly closing the channel without throwing a panic.☆13Aug 1, 2019Updated 6 years ago
- Go Version of Redis on PMEM☆12Dec 20, 2021Updated 4 years ago
- A fast text search engine built for SSDs, written in C++.☆11Aug 29, 2022Updated 3 years ago
- Examples of different methods to compose FaaS functions together☆10Jul 18, 2018Updated 7 years ago
- Materials to teach terminal fundamentals for HPC users☆19Aug 18, 2021Updated 4 years ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Research simulation toolkit for federated learning☆13Nov 7, 2020Updated 5 years ago
- CPAM: Compressed Parallel Augmented Maps☆27Aug 18, 2025Updated 7 months ago
- A multi-dimensional view over a contiguous array of data.☆11Oct 22, 2019Updated 6 years ago
- Whippletree, a novel approach to scheduling dynamic, irregular workloads on the GPU☆22Nov 24, 2015Updated 10 years ago
- Finite Element Analysis Toolbox 3☆16Feb 3, 2026Updated last month
- QUIC based speed test app☆12Apr 24, 2021Updated 4 years ago
- A wrapper arround mpiexec, gdbserver, and gdb that makes debugging MPI programs eaiser with a moderate number of processes.☆35Oct 13, 2025Updated 5 months ago
- ☆14Oct 28, 2011Updated 14 years ago
- Cross-platform socketpair functionality☆16May 15, 2025Updated 10 months ago
- Finite Element Modeling Technology☆13May 24, 2024Updated last year
- LLDP Fabric Info Parsing and DSC Resources used to configured Data Center Bridging - Check https://aka.ms/Validate-DCB for more informati…☆15Nov 28, 2022Updated 3 years ago
- This is a read-only mirror of the CRAN R package repository. speedglm — Fitting Linear and Generalized Linear Models to Large Data Sets…☆10May 6, 2023Updated 2 years ago
- A distributed heart rate monitor using Microsoft Band, Raspberry PI2, and Windows 10 UWP, Azure and Signal/R☆11Jul 15, 2015Updated 10 years ago