stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Updated 3 years ago
- Error-Free Transformations as building blocks for compensated algorithms☆16Updated 2 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated last month
- NPUEval is an LLM evaluation dataset written specifically to target AIE kernel code generation on RyzenAI hardware.☆26Updated 2 months ago
- Custom-Precision Floating-point numbers.☆41Updated last month
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 10 months ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 6 years ago
- muSYCL, the SYCL musical!☆13Updated last year
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆92Updated this week
- ☆21Updated last week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last week
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆51Updated last year
- A unified framework across multiple programming platforms☆42Updated 8 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆82Updated 5 months ago
- associative floating point addition☆19Updated last year
- ExBLAS: fast, accurate, and reproducible BLAS☆15Updated 4 years ago
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- ☆29Updated 2 months ago
- ☆11Updated 9 years ago
- Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations☆18Updated 13 years ago
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Round matrix elements to lower precision in MATLAB☆38Updated 3 years ago
- Code generation tool to generate mathematical libraries☆58Updated 3 months ago
- Data Dependence Analyzer in the Polyhedral Model☆21Updated 2 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆71Updated 10 months ago
- AI Accelerators-SC23-tutorial Repository☆11Updated 2 years ago
- ☆17Updated 4 years ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆124Updated last year