turingmotors / swanLinks

This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.

☆162

Alternatives and similar repositories for swan

Users that are interested in swan are comparing it to the libraries listed below

Sorting:

trevorpogue / algebraic-nnhw
Algebraic enhancements for GEMM & AI accelerators
☆277Updated 4 months ago
joennlae / halutmatmul
Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
☆211Updated last year
HLSTransform / submission
☆97Updated last year
tenstorrent / tt-smi
Tenstorrent console based hardware information program
☆45Updated last week
litex-hub / linux-on-litex-rocket
Run 64-bit Linux on LiteX + RocketChip
☆200Updated 11 months ago
rejunity / tiny-asic-1_58bit-matrix-mul
Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit
☆157Updated last year
AMDResearch / Riallto
The Riallto Open Source Project from AMD
☆81Updated 3 months ago
UCSBarchlab / OpenTPU
A open source reimplementation of Google's Tensor Processing Unit (TPU).
☆676Updated 7 years ago
chipsalliance / t1
☆276Updated this week
siliconcompiler / logik
A configurable RTL to bitstream FPGA toolchain
☆35Updated 3 weeks ago
nirw4nna / dsc
Tensor library & inference framework for machine learning
☆101Updated last week
tensil-ai / tensil
Open source machine learning accelerators
☆382Updated last year
exo-lang / exo
Exocompilation for productive programming of hardware accelerators
☆640Updated this week
atf-tuner / pyATF
☆16Updated 2 weeks ago
tenstorrent / tt-buda-demos
Repository of model demos using TT-Buda
☆62Updated 3 months ago
tenstorrent / riscv-ocelot
Ocelot: The Berkeley Out-of-Order Machine With V-EXT support
☆170Updated 6 months ago
ryuz / BinaryBrain
Binary Neural Network Framework for FPGA(Differentiable LUT)
☆162Updated last month
Qazalin / remu
RDNA3 emulator
☆54Updated 2 months ago
Foreseerr / TScale
☆196Updated 2 months ago
moonshine-ai / qc_npu_benchmark
Code sample showing how to run and benchmark models on Qualcomm's Window PCs
☆100Updated 9 months ago
geohot / tt-tiny
tiny code to access tenstorrent blackhole
☆55Updated last month
zeroasiccorp / umi
Universal Memory Interface (UMI)
☆147Updated last week
tenstorrent / tt-forge
Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…
☆78Updated this week
NNgen / nngen
NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network
☆353Updated last year
tenstorrent / tt-buda
Tenstorrent TT-BUDA Repository
☆313Updated 3 months ago
IST-DASLab / QUIK
Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024
☆180Updated last year
DeepWok / mase
Machine-Learning Accelerator System Exploration Tools
☆171Updated last month
moritztng / grayskull-attention
Attention in SRAM on Tenstorrent Grayskull
☆36Updated 11 months ago
aappleby / metron
A C++ to Verilog translation tool with some basic guarantees that your code will work.
☆171Updated 4 months ago
tenstorrent / tt-budabackend
Buda Compiler Backend for Tenstorrent devices
☆29Updated 3 months ago