stillwater-sc / hpr-blas
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas:
Users that are interested in hpr-blas are comparing it to the libraries listed below
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 3 years ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- Custom-Precision Floating-point numbers.☆33Updated 2 months ago
- FPGA acceleration of arbitrary precision floating point computations.☆38Updated 2 years ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated 6 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆14Updated 2 years ago
- SForum 2020 : "A Run-time Hardware Routing Implementation for CGRA Overlays" code and data.☆11Updated 4 years ago
- muSYCL, the SYCL musical!☆12Updated 6 months ago
- A Deep Learning Framework for the Posit Number System☆27Updated 7 months ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆36Updated 3 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆18Updated last year
- AI Accelerators-SC23-tutorial Repository☆11Updated last year
- A polyhedral compiler for hardware accelerators☆56Updated 7 months ago
- ☆9Updated 2 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- ☆20Updated 3 years ago
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆20Updated 4 months ago
- Round matrix elements to lower precision in MATLAB☆36Updated 2 years ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆73Updated last month
- High-level synthesis Integer library☆9Updated 3 years ago
- Custom BLAS and LAPACK Cross-Compilation Framework for RISC-V☆19Updated 4 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- ☆16Updated 3 years ago
- The test suite for the Xyce Parallel Electronic Simulator☆4Updated 2 weeks ago
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 3 months ago
- Accelerator simulation framework using nn_dataflow traces and energy, etc. post-processing☆7Updated 6 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago