tenstorrent / tensix-isa-simulator
☆17Updated 2 weeks ago
Alternatives and similar repositories for tensix-isa-simulator:
Users that are interested in tensix-isa-simulator are comparing it to the libraries listed below
- Attention in SRAM on Tenstorrent Grayskull☆32Updated 8 months ago
- User-Mode Driver for Tenstorrent hardware☆16Updated this week
- Tenstorrent Kernel Module☆39Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆30Updated this week
- Tenstorrent MLIR compiler☆107Updated this week
- Tenstorrent system interface library☆16Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆182Updated last month
- The Riallto Open Source Project from AMD☆75Updated 4 months ago
- tenstorrent kernel from twitch☆27Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last week
- Buda Compiler Backend for Tenstorrent devices☆28Updated last month
- ☆56Updated last week
- Repository of model demos using TT-Buda☆63Updated 2 weeks ago
- Tenstorrent console based hardware information program☆35Updated last week
- Example for running IREE in a bare-metal Arm environment.☆33Updated last month
- ☆83Updated this week
- ROCm Systems Profiler☆16Updated this week
- Bandwidth test for ROCm☆54Updated 2 weeks ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆74Updated this week
- Custom PTX Instruction Benchmark☆120Updated last month
- Fork of LLVM to support AMD AIEngine processors☆129Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated 2 weeks ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆37Updated this week
- Library for modelling performance costs of different Neural Network workloads on NPU devices☆32Updated this week
- ☆12Updated last week
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆60Updated 2 weeks ago
- GPTPU for SC 2021☆51Updated 2 years ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆84Updated this week
- ☆45Updated this week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆117Updated 2 months ago