tenstorrent / tt-buda-demos
Repository of model demos using TT-Buda
☆55Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for tt-buda-demos
- Tenstorrent TT-BUDA Repository☆225Updated last month
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆25Updated this week
- TVM for Tenstorrent ASICs☆20Updated this week
- Tenstorrent MLIR compiler☆76Updated this week
- Development repository for the Triton language and compiler☆93Updated this week
- Repository for AI model benchmarking.☆11Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆20Updated this week
- Buda Compiler Backend for Tenstorrent devices☆26Updated last month
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆475Updated this week
- Attention in SRAM on Tenstorrent Grayskull☆29Updated 4 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- The Riallto Open Source Project from AMD☆68Updated last week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆55Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆95Updated this week
- Tenstorrent console based hardware information program☆23Updated 2 weeks ago
- ☆39Updated 2 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆100Updated 3 weeks ago
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆63Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆163Updated last month
- Tenstorrent Kernel Module☆32Updated last week
- An experimental CPU backend for Triton☆56Updated last week
- Tenstorrent system interface library☆14Updated 2 weeks ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- ☆152Updated this week
- OpenAI Triton backend for Intel® GPUs☆143Updated this week
- Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024☆173Updated 7 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆100Updated 11 months ago
- Ahead of Time (AOT) Triton Math Library☆41Updated this week
- A lightweight MLIR Python frontend with support for PyTorch☆21Updated 2 months ago
- ☆58Updated last year