LLNL / adapt-fpLinks
☆18Updated 3 years ago
Alternatives and similar repositories for adapt-fp
Users that are interested in adapt-fp are comparing it to the libraries listed below
Sorting:
- development repository for the open earth compiler☆82Updated 4 years ago
- Programmable JIT Compilation and Optimization for C/C++ using LLVM☆41Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆212Updated this week
- Custom-Precision Floating-point numbers.☆41Updated this week
- A lightweight, Pythonic, frontend for MLIR☆81Updated 2 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last week
- ☆41Updated 3 months ago
- ☆10Updated 2 years ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆36Updated 3 months ago
- A searchable Python interface to the SuiteSparse Matrix Collection☆56Updated 3 years ago
- Python wrapper for isl, an integer set library☆83Updated last week
- ☆26Updated 8 months ago
- POC work on MLIR backend☆61Updated last year
- Error-Free Transformations as building blocks for compensated algorithms☆16Updated 2 years ago
- ☆20Updated 6 years ago
- A web interface for the SuiteSparse Matrix Collection, formerly known as the University of Florida Sparse Matrix Collection☆25Updated 8 months ago
- Tensor Contraction Code Generator☆39Updated 8 years ago
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆49Updated this week
- COCCL: Compression and precision co-aware collective communication library☆30Updated 10 months ago
- ☆103Updated last week
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆321Updated 5 months ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆45Updated 6 months ago
- GPU Performance Advisor☆65Updated 3 years ago
- An out-of-tree MLIR dialect template.☆113Updated last year
- ☆137Updated 3 months ago
- DaCe - Data Centric Parallel Programming☆573Updated this week
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆39Updated 3 weeks ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆350Updated 2 months ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆81Updated 6 months ago
- ☆276Updated this week