stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 3 years ago
- Custom-Precision Floating-point numbers.☆37Updated 6 months ago
- A pseudo random number generator library written against the SYCL API.☆12Updated 6 years ago
- ☆19Updated last week
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- SForum 2020 : "A Run-time Hardware Routing Implementation for CGRA Overlays" code and data.☆11Updated 4 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆84Updated 2 weeks ago
- Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations☆18Updated 12 years ago
- A GPU performance prediction toolkit for CUDA programs☆17Updated 6 years ago
- FFTX Project☆25Updated this week
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated 10 months ago
- Geant4 EM physics simulation R&D project looking for solutions to reduce the computing performance bottleneck experienced by HEP detector…☆12Updated last month
- Advanced Programming for Computer Design Problems☆18Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆36Updated 2 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Updated 2 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆21Updated 7 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- ☆21Updated 3 years ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆94Updated 3 months ago
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- ExBLAS: fast, accurate, and reproducible BLAS☆13Updated 3 years ago
- MATLAB Code for Parameters of Floating-Point Arithmetics☆8Updated 3 years ago
- Custom BLAS and LAPACK Cross-Compilation Framework for RISC-V☆19Updated 5 years ago
- The Self-Organizing NUMbers. A number format that learns from data.☆10Updated 5 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- Polyhedral Compilation tool for High Level Synthesis.☆10Updated 11 years ago
- Teaching materials, slides and exercises, for the GPU & CUDA training in 2017☆13Updated 8 years ago