tenstorrent / tt-umd
User-Mode Driver for Tenstorrent hardware
☆14Updated this week
Alternatives and similar repositories for tt-umd:
Users that are interested in tt-umd are comparing it to the libraries listed below
- Main Repo for the OpenHW Group Software Task Group☆15Updated 2 weeks ago
- Example for running IREE in a bare-metal Arm environment.☆26Updated 2 weeks ago
- The Riallto Open Source Project from AMD☆71Updated 2 months ago
- Heterogeneous Cluster Interconnect to bind special-purpose HW accelerators with general-purpose cluster cores☆12Updated last week
- This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…☆18Updated last month
- 2-8bit weights, 8-bit activations flexible Neural Processing Engine for PULP clusters☆19Updated 3 months ago
- Tenstorrent system interface library☆14Updated last month
- ☆33Updated 6 months ago
- Quite OK image compression Verilog implementation☆19Updated 2 months ago
- ☆15Updated 4 months ago
- ☆22Updated last month
- Lake is a framework for generating synthesizable memory modules from a high-level behavioral specification and widely-available memory ma…☆20Updated this week
- Following the RISC-V IME extension standard, and reusing Vector register resources, these instructions can bring more than a tenfold perf…☆46Updated 5 months ago
- Attention in SRAM on Tenstorrent Grayskull☆31Updated 6 months ago
- FPGA acceleration of arbitrary precision floating point computations.☆38Updated 2 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆38Updated this week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆36Updated 3 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆49Updated last year
- Bandwidth test for ROCm☆53Updated this week
- The ISA specification for the ZiCondOps extension.☆19Updated 10 months ago
- RISC-V GPGPU☆34Updated 4 years ago
- Wrapper shells enabling designs generated by rocket-chip to map onto certain FPGA boards☆16Updated 2 months ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 2 years ago
- rocWMMA☆100Updated this week
- ☆24Updated 2 years ago
- A high-efficiency system-on-chip for floating-point compute workloads.☆26Updated 2 weeks ago
- Tenstorrent Kernel Module☆35Updated this week
- Buda Compiler Backend for Tenstorrent devices☆26Updated 4 months ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Updated 2 years ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆38Updated 7 months ago