moritztng / grayskull-attentionLinks
Attention in SRAM on Tenstorrent Grayskull
☆38Updated last year
Alternatives and similar repositories for grayskull-attention
Users that are interested in grayskull-attention are comparing it to the libraries listed below
Sorting:
- High-Performance SGEMM on CUDA devices☆110Updated 9 months ago
- Tenstorrent MLIR compiler☆211Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 2 months ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆100Updated 4 months ago
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆51Updated this week
- Custom PTX Instruction Benchmark☆132Updated 8 months ago
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆140Updated this week