stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Updated 3 years ago
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- Custom-Precision Floating-point numbers.☆41Updated 2 weeks ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated 3 weeks ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- ☆19Updated 3 weeks ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 9 months ago
- A tool for debugging and assessing floating point precision and reproducibility.☆91Updated 2 months ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 6 years ago
- NPUEval is an LLM evaluation dataset written specifically to target AIE kernel code generation on RyzenAI hardware.☆25Updated 2 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆32Updated this week
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Updated 2 years ago
- ☆27Updated last month
- muSYCL, the SYCL musical!☆13Updated last year
- ☆17Updated 4 years ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Updated last year
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last month
- SST Macro Element Library☆36Updated 2 months ago
- Round matrix elements to lower precision in MATLAB☆37Updated 3 years ago
- Code generation tool to generate mathematical libraries☆58Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆82Updated 4 months ago
- Drop-in replacement for IEEE Float☆42Updated 5 years ago
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- Recursive LAPACK Collection☆44Updated 3 years ago
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- ☆41Updated 3 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 4 years ago
- Custom BLAS and LAPACK Cross-Compilation Framework for RISC-V☆18Updated 5 years ago