turingmotors / swanLinks
This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.
☆166Updated last year
Alternatives and similar repositories for swan
Users that are interested in swan are comparing it to the libraries listed below
Sorting:
- Algebraic enhancements for GEMM & AI accelerators☆279Updated 7 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆213Updated last year
- The Riallto Open Source Project from AMD☆83Updated 5 months ago
- ☆103Updated last year
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆162Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆203Updated 2 months ago
- ☆289Updated this week
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆706Updated 7 years ago
- A configurable RTL to bitstream FPGA toolchain☆43Updated last week
- Binary Neural Network Framework for FPGA(Differentiable LUT)☆161Updated last month
- Tenstorrent console based hardware information program☆54Updated this week
- tiny code to access tenstorrent blackhole☆59Updated 4 months ago
- Universal Memory Interface (UMI)☆152Updated this week
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆178Updated this week
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆938Updated last month
- Open source machine learning accelerators☆388Updated last year
- Exocompilation for productive programming of hardware accelerators☆661Updated this week
- Machine-Learning Accelerator System Exploration Tools☆178Updated this week
- Fork of LLVM to support AMD AIEngine processors☆164Updated this week
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆183Updated last year
- NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network☆356Updated last year
- Repository of model demos using TT-Buda☆62Updated 5 months ago
- Example for running IREE in a bare-metal Arm environment.☆40Updated 2 months ago
- ☆16Updated 3 months ago
- A C++ to Verilog translation tool with some basic guarantees that your code will work.☆173Updated 7 months ago
- Tensor library & inference framework for machine learning☆110Updated this week
- A survey on Hardware Accelerated LLMs☆59Updated 8 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆101Updated 11 months ago
- A high-efficiency system-on-chip for floating-point compute workloads.☆43Updated 8 months ago
- A Heterogeneous Platform Deep Learning Compiler Framework from EdgeCortix☆33Updated last year