turingmotors / swanLinks
This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.
☆169Updated last year
Alternatives and similar repositories for swan
Users that are interested in swan are comparing it to the libraries listed below
Sorting:
- ☆130Updated this week
- Algebraic enhancements for GEMM & AI accelerators☆286Updated 10 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆215Updated 2 years ago
- tiny code to access tenstorrent blackhole☆61Updated 7 months ago
- Tenstorrent console based hardware information program☆58Updated this week
- ☆117Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆208Updated 2 months ago
- The Riallto Open Source Project from AMD☆83Updated 8 months ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆172Updated last year
- Open source machine learning accelerators☆394Updated last year
- ☆306Updated this week
- Universal Memory Interface (UMI)☆156Updated 2 weeks ago
- A configurable RTL to bitstream FPGA toolchain☆55Updated 3 weeks ago
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆721Updated 8 years ago
- a mini 2x2 systolic array and PE demo☆66Updated 2 weeks ago
- ☆17Updated 2 months ago
- Exocompilation for productive programming of hardware accelerators☆696Updated last week
- Repository of model demos using TT-Buda☆63Updated 9 months ago
- Tensor library & inference framework for machine learning☆118Updated 3 months ago
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆294Updated this week
- GPEmu, a GPU emulator for faster and cheaper prototyping and evaluation of deep learning system research☆36Updated last year
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆206Updated 3 weeks ago
- Machine-Learning Accelerator System Exploration Tools☆186Updated this week
- Binary Neural Network Framework for FPGA(Differentiable LUT)☆169Updated 4 months ago
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆1,102Updated 4 months ago
- RTL logic synthesis☆123Updated 2 months ago
- A C++ to Verilog translation tool with some basic guarantees that your code will work.☆176Updated 10 months ago
- Buda Compiler Backend for Tenstorrent devices☆30Updated 9 months ago
- ☆199Updated 8 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆104Updated last year