stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 3 years ago
- BLAS implementation for Intel FPGA☆78Updated 4 years ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated 3 years ago
- Custom-Precision Floating-point numbers.☆36Updated 4 months ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated 9 months ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- FPGA acceleration of arbitrary precision floating point computations.☆40Updated 3 years ago
- A posit arithmetic emulator.☆53Updated 11 months ago
- The test suite for the Xyce Parallel Electronic Simulator☆4Updated 3 weeks ago
- SForum 2020 : "A Run-time Hardware Routing Implementation for CGRA Overlays" code and data.☆11Updated 4 years ago
- ☆21Updated 3 years ago
- ☆9Updated 3 years ago
- A Deep Learning Framework for the Posit Number System☆28Updated 9 months ago
- muSYCL, the SYCL musical!☆12Updated 9 months ago
- Round matrix elements to lower precision in MATLAB☆37Updated 2 years ago
- AI Accelerators-SC23-tutorial Repository☆11Updated last year
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆34Updated 2 weeks ago
- ☆17Updated 2 weeks ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 6 years ago
- Stencil with Optimized Dataflow Architecture☆16Updated last year
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago
- Custom BLAS and LAPACK Cross-Compilation Framework for RISC-V☆19Updated 5 years ago
- Accelerator simulation framework using nn_dataflow traces and energy, etc. post-processing☆7Updated 6 years ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- This is a hardware implementation of exact multiply accumulator for 32-bit posit number with es=2☆16Updated 7 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆77Updated 4 months ago
- c++ posit implementation☆44Updated last year
- The Self-Organizing NUMbers. A number format that learns from data.☆10Updated 5 years ago