stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Updated 3 years ago
- BLAS implementation for Intel FPGA☆77Updated 5 years ago
- Custom-Precision Floating-point numbers.☆38Updated 10 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated 4 years ago
- muSYCL, the SYCL musical!☆12Updated last year
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 6 years ago
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆31Updated this week
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 8 months ago
- NPUEval is an LLM evaluation dataset written specifically to target AIE kernel code generation on RyzenAI hardware.☆24Updated 3 weeks ago
- A tool for debugging and assessing floating point precision and reproducibility.☆90Updated last month
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- Recursive LAPACK Collection☆44Updated 3 years ago
- ☆19Updated 2 weeks ago
- ExBLAS: fast, accurate, and reproducible BLAS☆13Updated 4 years ago
- ☆11Updated 9 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆90Updated last month
- ☆28Updated 2 months ago
- ☆17Updated 4 years ago
- A unified framework across multiple programming platforms☆42Updated 6 months ago
- Data Dependence Analyzer in the Polyhedral Model☆21Updated 2 years ago
- cuASR: CUDA Algebra for Semirings☆42Updated 3 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆23Updated 3 weeks ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆70Updated 8 months ago
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Library to plot integer sets and maps☆53Updated 9 years ago
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆56Updated 2 years ago