vatai / tadashiLinks
A library for code transformations with guaranteed legality
☆15Updated this week
Alternatives and similar repositories for tadashi
Users that are interested in tadashi are comparing it to the libraries listed below
Sorting:
- Custom-Precision Floating-point numbers.☆38Updated 8 months ago
- An HPL-AI implementation for Fugaku☆21Updated 4 years ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆36Updated 2 weeks ago
- ☆18Updated 5 years ago
- cuASR: CUDA Algebra for Semirings☆38Updated 3 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Updated 3 weeks ago
- A unified framework across multiple programming platforms☆41Updated 3 months ago
- ☆40Updated this week
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 5 months ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆86Updated 5 months ago
- Data-Centric MLIR dialect☆43Updated last year
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- ☆76Updated last month
- ☆12Updated 4 years ago
- ☆18Updated last year
- development repository for the open earth compiler☆80Updated 4 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 4 years ago
- Sparsity support for PyTorch☆37Updated 5 months ago
- ☆74Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆85Updated this week
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- ☆25Updated last week
- CUDA Dynamic Memory Allocator for SOA Data Layout☆38Updated 3 years ago
- Data Dependence Analyzer in the Polyhedral Model☆21Updated last year
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆59Updated this week
- Programmable JIT Compilation and Optimization for C/C++ using LLVM☆29Updated this week
- MagmaDNN: a simple deep learning framework in c++☆50Updated 5 years ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated last week
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Updated 2 years ago