stillwater-sc / hpr-blasLinks
High-Performance Reproducible BLAS using posit arithmetic
☆12Updated 3 years ago
Alternatives and similar repositories for hpr-blas
Users that are interested in hpr-blas are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Updated 3 years ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- Custom-Precision Floating-point numbers.☆38Updated 9 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- NPUEval is an LLM evaluation dataset written specifically to target AIE kernel code generation on RyzenAI hardware.☆24Updated 2 months ago
- ☆19Updated last month
- A tool for debugging and assessing floating point precision and reproducibility.☆88Updated 3 weeks ago
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 6 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated 4 years ago
- cuASR: CUDA Algebra for Semirings☆41Updated 3 years ago
- Recursive LAPACK Collection☆44Updated 3 years ago
- muSYCL, the SYCL musical!☆12Updated last year
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 7 months ago
- ☆28Updated last month
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆36Updated 2 weeks ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆80Updated 2 months ago
- Fork of llvm/llvm-project for f18. In sync with f18-mlir and f18.☆28Updated 2 years ago
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆31Updated this week
- AI Accelerators-SC23-tutorial Repository☆11Updated last year
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- ☆11Updated 9 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated last week
- ☆18Updated last year
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Updated last year
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆43Updated 2 years ago
- ExBLAS: fast, accurate, and reproducible BLAS☆13Updated 4 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 4 years ago