north-numerical-computing / cpfloat
Custom-Precision Floating-point numbers.
☆33Updated 2 months ago
Alternatives and similar repositories for cpfloat:
Users that are interested in cpfloat are comparing it to the libraries listed below
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 3 months ago
- ☆30Updated 2 years ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- Round matrix elements to lower precision in MATLAB☆36Updated 2 years ago
- development repository for the open earth compiler☆79Updated 4 years ago
- Error-Free Transformations as building blocks for compensated algorithms☆14Updated 2 years ago
- A web interface for the SuiteSparse Matrix Collection, formerly known as the University of Florida Sparse Matrix Collection☆23Updated 3 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆31Updated 4 months ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆18Updated last year
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated last month
- ☆40Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated last week
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated 2 years ago
- Data-Centric MLIR dialect☆40Updated last year
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆71Updated last month
- Benchmark for measuring the performance of sparse and irregular memory access.☆77Updated last month
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆73Updated last month
- AI Accelerators-SC23-tutorial Repository☆11Updated last year
- ☆14Updated last year
- The SCMC and PSCMC programming language☆18Updated last year
- ☆17Updated 5 years ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- ☆43Updated 4 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆24Updated 5 years ago
- High-Performance Reproducible BLAS using posit arithmetic☆12Updated 3 years ago
- pLiner is a framework that helps programmers identify locations in the source of numerical code that are highly affected by compiler opti…☆17Updated last year
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago