turingmotors / swanLinks
This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.
☆169Updated last year
Alternatives and similar repositories for swan
Users that are interested in swan are comparing it to the libraries listed below
Sorting:
- Algebraic enhancements for GEMM & AI accelerators☆282Updated 9 months ago
- ☆111Updated last year
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆214Updated last year
- The Riallto Open Source Project from AMD☆82Updated 7 months ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆170Updated last year
- Tenstorrent console based hardware information program☆57Updated this week
- Open source machine learning accelerators☆392Updated last year
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆707Updated 7 years ago
- Tensor library & inference framework for machine learning☆113Updated last month
- Run 64-bit Linux on LiteX + RocketChip☆205Updated last month
- ☆302Updated this week
- Exocompilation for productive programming of hardware accelerators☆683Updated last week
- tiny code to access tenstorrent blackhole☆61Updated 6 months ago
- GPEmu, a GPU emulator for faster and cheaper prototyping and evaluation of deep learning system research☆34Updated 11 months ago
- A configurable RTL to bitstream FPGA toolchain☆54Updated this week
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆991Updated 3 months ago
- Repository of model demos using TT-Buda☆63Updated 7 months ago
- Binary Neural Network Framework for FPGA(Differentiable LUT)☆165Updated 3 months ago
- A Heterogeneous Platform Deep Learning Compiler Framework from EdgeCortix☆33Updated last year
- Machine-Learning Accelerator System Exploration Tools☆183Updated 3 weeks ago
- NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network☆357Updated 2 years ago
- Universal Memory Interface (UMI)☆154Updated this week
- ☆16Updated last month
- Fork of LLVM to support AMD AIEngine processors☆174Updated this week
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆196Updated 3 weeks ago
- Example for running IREE in a bare-metal Arm environment.☆39Updated 4 months ago
- RDNA3 emulator☆55Updated 7 months ago
- A survey on Hardware Accelerated LLMs☆59Updated 10 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Updated 8 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆103Updated last year