Attention in SRAM on Tenstorrent Grayskull
☆38Jul 18, 2024Updated last year
Alternatives and similar repositories for grayskull-attention
Users that are interested in grayskull-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple experiments on Tenstorrent GraySkull e75 chip☆14Aug 28, 2024Updated last year
- Tenstorrent MLIR compiler☆280Updated this week
- TVM for Tenstorrent ASICs☆31Apr 29, 2026Updated last month
- The Tenstorrent Studio (TT-Studio) is an easy to use web interface for running AI models on Tenstorrent hardware. It handles all the tech…☆48Updated this week
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- tiny code to access tenstorrent blackhole☆66May 26, 2025Updated last year
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆64Updated this week
- Buda Compiler Backend for Tenstorrent devices☆31Apr 2, 2025Updated last year
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 4 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- Tenstorrent TT-BUDA Repository☆314Feb 9, 2026Updated 4 months ago
- Tenstorrent system interface library☆34Updated this week
- Tenstorrent Firmware Update Utility☆13Jun 4, 2026Updated 2 weeks ago
- Guichan is a C++ GUI library designed for games.☆14Oct 22, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆29Jun 1, 2026Updated 2 weeks ago
- User-Mode Driver for Tenstorrent hardware☆43Updated this week
- Tensara's GPU programming problems☆20Apr 23, 2026Updated last month
- [Deprecated] ⭐️ TT-NN Compiler for PyTorch 2 ⭐️ Enables running PyTorch models on Tenstorrent hardware using eager or compile path☆62Feb 24, 2026Updated 3 months ago
- ☆58Updated this week
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- ☆16Sep 24, 2024Updated last year
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 9 months ago
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sample programs for the LLVM PTX back-end☆41Aug 27, 2015Updated 10 years ago
- Examples and training code for Machine Learning samples that can be run on various Edge devices☆10Jan 8, 2025Updated last year
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆42Nov 16, 2021Updated 4 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- ☆30Jun 7, 2025Updated last year
- Tenstorrent Kernel Module☆65Jun 11, 2026Updated last week
- Scripts to prepare OXFORD VGG Face dataset☆12Mar 29, 2016Updated 10 years ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆281Updated this week
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- It's a baby compiler. (Lean btw.)☆16May 19, 2025Updated last year
- 初秋,一個方便測試 ActivityPub 實作的實作。☆11Aug 4, 2024Updated last year
- Noisy language compiler☆17Jul 31, 2024Updated last year
- Automatic virtualization of (general) accelerators.☆47Nov 28, 2022Updated 3 years ago
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆267Updated this week
- ☆12May 23, 2018Updated 8 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago