stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Updated 3 years ago
- BLAS implementation for Intel FPGA☆77Updated 5 years ago
- Custom-Precision Floating-point numbers.☆39Updated last week
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 6 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆90Updated last week
- A tool for debugging and assessing floating point precision and reproducibility.☆90Updated 2 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 9 months ago
- NPUEval is an LLM evaluation dataset written specifically to target AIE kernel code generation on RyzenAI hardware.☆24Updated last month
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated last week
- Advanced Programming for Computer Design Problems☆17Updated 4 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆23Updated last month
- Library to plot integer sets and maps☆53Updated 9 years ago
- ☆19Updated this week
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Updated 2 years ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆39Updated 3 weeks ago
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆82Updated 4 months ago
- ExBLAS: fast, accurate, and reproducible BLAS☆13Updated 4 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆31Updated this week
- AI Accelerators-SC23-tutorial Repository☆11Updated 2 years ago
- cuASR: CUDA Algebra for Semirings☆42Updated 3 years ago
- ☆11Updated 9 years ago
- A unified framework across multiple programming platforms☆42Updated 6 months ago
- ☆41Updated 2 months ago
- Recursive LAPACK Collection☆44Updated 3 years ago
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- SST Macro Element Library☆36Updated last month
- Round matrix elements to lower precision in MATLAB☆37Updated 3 years ago
- ☆27Updated 3 weeks ago