Linear algebra subroutines for large SSD-resident dense and sparse matrices
☆29Dec 14, 2020Updated 5 years ago
Alternatives and similar repositories for BLAS-on-flash
Users that are interested in BLAS-on-flash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- Companion source code for GTC 2014 talk☆11Mar 25, 2014Updated 12 years ago
- BlueDBM hw/sw implementation using the bluespecpcie PCIe library☆12Dec 25, 2022Updated 3 years ago
- iMLBench is a machine learning benchmark suite targeting CPU-GPU integrated architectures.☆11May 29, 2021Updated 5 years ago
- DPDK-based UDP echo server☆13Mar 27, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tools for MPI programmers☆14Sep 21, 2020Updated 5 years ago
- Unifies OS page cache for heterogeneous systems☆13Jul 26, 2019Updated 6 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆24Nov 2, 2025Updated 7 months ago
- A toy ML-like programming language☆16Sep 2, 2012Updated 13 years ago
- A Lisp syntax for Haskell.☆22May 11, 2012Updated 14 years ago
- 🦅 VSCode extension for F* with IDE features☆16Mar 21, 2020Updated 6 years ago
- Tool to detect and report leaked MPI objects like MPI_Requests and MPI_Datatypes☆14Sep 17, 2014Updated 11 years ago
- The MPI parallel MD-Workbench simulates user activities.☆12Jun 23, 2019Updated 6 years ago
- MPI Library Memory Consumption Utilities☆19Apr 21, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Command-line JSON processor☆14Oct 23, 2019Updated 6 years ago
- Automatic parallelizer for C/C++ code☆15Nov 21, 2019Updated 6 years ago
- generic C++ containers; matrix, triangle matrix, crs sparse matrix, etc.☆12Mar 23, 2018Updated 8 years ago
- Examples of different methods to compose FaaS functions together☆10Jul 18, 2018Updated 7 years ago
- ☆23Aug 14, 2022Updated 3 years ago
- Extended docker build tool.☆16Jun 12, 2023Updated 2 years ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- CPAM: Compressed Parallel Augmented Maps☆27Aug 18, 2025Updated 9 months ago
- A multi-dimensional view over a contiguous array of data.☆11Oct 22, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A generic and efficient SIMD implementation of MSB Radix Sort with separate key and payload datastreams that supports arbitrary key and p…☆15Jan 31, 2025Updated last year
- Finite Element Analysis Toolbox 3☆16May 28, 2026Updated 2 weeks ago
- QUIC based speed test app☆12Apr 24, 2021Updated 5 years ago
- node.js bindings for Azure Speech SDK☆15Mar 31, 2026Updated 2 months ago
- ☆14Oct 28, 2011Updated 14 years ago
- Code coverage processor for sbt☆32Feb 21, 2011Updated 15 years ago
- Cross-platform socketpair functionality☆17May 15, 2025Updated last year
- This is a read-only mirror of the CRAN R package repository. speedglm — Fitting Linear and Generalized Linear Models to Large Data Sets…☆10May 6, 2023Updated 3 years ago
- LLDP Fabric Info Parsing and DSC Resources used to configured Data Center Bridging - Check https://aka.ms/Validate-DCB for more informati…☆14Nov 28, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Finite Element Modeling Technology☆14May 24, 2024Updated 2 years ago
- A distributed heart rate monitor using Microsoft Band, Raspberry PI2, and Windows 10 UWP, Azure and Signal/R☆11Jul 15, 2015Updated 10 years ago
- [Starter project] web server & client. Fully C++/WebAssembly. Server runs on google cloud function. Client uses a C++ virtual dom.☆11Jun 10, 2019Updated 7 years ago
- Howard Hinnant's example short_alloc (stack-based allocator) that demonstrates allocators in a short and sweet manner☆11Oct 29, 2015Updated 10 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- for paper @ ASPLOS‘25’☆16Mar 27, 2025Updated last year
- Immutable/persistent functional data structures for C++11☆10Mar 20, 2019Updated 7 years ago