Attention in SRAM on Tenstorrent Grayskull
☆40Jul 18, 2024Updated last year
Alternatives and similar repositories for grayskull-attention
Users that are interested in grayskull-attention are comparing it to the libraries listed below
Sorting:
- ☆15Feb 7, 2026Updated last month
- Tenstorrent MLIR compiler☆250Updated this week
- TT-Studio : An all-in-one platform to deploy and manage AI models optimized for Tenstorrent hardware with dedicated front-end demo applic…☆40Updated this week
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated last week
- TVM for Tenstorrent ASICs☆28Sep 8, 2025Updated 6 months ago
- Buda Compiler Backend for Tenstorrent devices☆30Apr 2, 2025Updated 11 months ago
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 3 years ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- ☆16Sep 24, 2024Updated last year
- Noisy language compiler☆17Jul 31, 2024Updated last year
- Automatic virtualization of (general) accelerators.☆47Nov 28, 2022Updated 3 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- A collection of trading settings for the Galileo FX trading robot. These settings are designed to optimize trading strategies across vari…☆13Jan 27, 2025Updated last year
- ☆47Updated this week
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- tenstorrent kernel from twitch☆28Mar 16, 2024Updated last year
- Tenstorrent TT-BUDA Repository☆314Feb 9, 2026Updated last month
- ☆24Mar 26, 2023Updated 2 years ago
- ☆42Nov 1, 2025Updated 4 months ago
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,374Updated this week
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆215Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 11 months ago
- Sample programs for the LLVM PTX back-end☆41Aug 27, 2015Updated 10 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Oct 25, 2021Updated 4 years ago
- Tenstorrent system interface library☆34Feb 18, 2026Updated 2 weeks ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆35Aug 25, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 10 months ago
- ☆33Mar 6, 2023Updated 3 years ago
- ☆10Apr 21, 2024Updated last year
- A Real time LiDAR-Visual-Inertial object level semantic SLAM for Forest Environments☆13Dec 2, 2024Updated last year
- A novel approach to detect metallic object on a moving target using wifi radios and deep learning.☆11Jan 16, 2019Updated 7 years ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- A novell, highly-optimized CUDA implementation of k-means algorithm.☆42Mar 3, 2022Updated 4 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 8 years ago