turingmotors / swanLinks
This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.
☆170Updated last year
Alternatives and similar repositories for swan
Users that are interested in swan are comparing it to the libraries listed below
Sorting:
- ☆150Updated last week
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆215Updated 2 years ago
- Algebraic enhancements for GEMM & AI accelerators☆286Updated 10 months ago
- Tenstorrent console based hardware information program☆58Updated this week
- ☆119Updated 2 years ago
- The Riallto Open Source Project from AMD☆83Updated 9 months ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆172Updated last year
- tiny code to access tenstorrent blackhole☆61Updated 7 months ago
- Open source machine learning accelerators☆395Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆208Updated 3 months ago
- Repository of model demos using TT-Buda☆63Updated 9 months ago
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆723Updated 8 years ago
- Exocompilation for productive programming of hardware accelerators☆699Updated this week
- Tensor library & inference framework for machine learning☆118Updated 3 months ago
- Binary Neural Network Framework for FPGA(Differentiable LUT)☆169Updated 5 months ago
- A configurable RTL to bitstream FPGA toolchain☆55Updated this week
- ☆306Updated this week
- Universal Memory Interface (UMI)☆156Updated 2 weeks ago
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆294Updated this week
- GPEmu, a GPU emulator for faster and cheaper prototyping and evaluation of deep learning system research☆37Updated last year
- NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network☆358Updated 2 years ago
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆208Updated 3 weeks ago
- Machine-Learning Accelerator System Exploration Tools☆186Updated this week
- Example for running IREE in a bare-metal Arm environment.☆40Updated 5 months ago
- ☆17Updated 2 months ago
- a mini 2x2 systolic array and PE demo☆66Updated 3 weeks ago
- Tenstorrent TT-BUDA Repository☆314Updated 9 months ago
- RTL logic synthesis☆123Updated 2 months ago
- Fork of LLVM to support AMD AIEngine processors☆182Updated this week
- A Heterogeneous Platform Deep Learning Compiler Framework from EdgeCortix☆33Updated last year