Attention in SRAM on Tenstorrent Grayskull
☆40Jul 18, 2024Updated last year
Alternatives and similar repositories for grayskull-attention
Users that are interested in grayskull-attention are comparing it to the libraries listed below
Sorting:
- ☆15Feb 7, 2026Updated last month
- Tenstorrent MLIR compiler☆250Updated this week
- Simple experiments on Tenstorrent GraySkull e75 chip☆13Aug 28, 2024Updated last year
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆54Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆40Updated this week
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated last week
- User-Mode Driver for Tenstorrent hardware☆38Updated this week
- TVM for Tenstorrent ASICs☆28Sep 8, 2025Updated 6 months ago
- Buda Compiler Backend for Tenstorrent devices☆30Apr 2, 2025Updated 11 months ago
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 3 years ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆16Sep 24, 2024Updated last year
- Noisy language compiler☆17Jul 31, 2024Updated last year
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- tiny code to access tenstorrent blackhole☆63May 26, 2025Updated 9 months ago
- Automatic virtualization of (general) accelerators.☆47Nov 28, 2022Updated 3 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- Tenstorrent Firmware Update Utility☆10Mar 2, 2026Updated last week
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- ☆47Updated this week
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- SYCL Reference Manual☆30Feb 11, 2026Updated 3 weeks ago
- tenstorrent kernel from twitch☆28Mar 16, 2024Updated last year
- Tenstorrent TT-BUDA Repository☆314Feb 9, 2026Updated last month
- ☆42Nov 1, 2025Updated 4 months ago
- ☆24Mar 26, 2023Updated 2 years ago
- ☆29Mar 18, 2025Updated 11 months ago
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,374Updated this week
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆215Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Apr 2, 2025Updated 11 months ago
- A Specification and a Library for Data Exchange in Polyhedral Compilation Tools☆32Jul 19, 2024Updated last year
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Oct 25, 2021Updated 4 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- ☆42Mar 28, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 10 months ago
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆35Aug 25, 2024Updated last year
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- Re-implementation of the TASO compiler using equality saturation☆138Jun 28, 2021Updated 4 years ago