trevorpogue / algebraic-nnhwLinks
Algebraic enhancements for GEMM & AI accelerators
☆278Updated 5 months ago
Alternatives and similar repositories for algebraic-nnhw
Users that are interested in algebraic-nnhw are comparing it to the libraries listed below
Sorting:
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- ☆248Updated last year
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆101Updated 10 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆253Updated last year
- Tensor library & inference framework for machine learning☆107Updated this week
- Richard is gaining power☆194Updated 2 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 11 months ago
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆467Updated this week
- Sequential Logic☆111Updated last week
- Docker-based inference engine for AMD GPUs☆231Updated 10 months ago
- throwaway GPT inference☆140Updated last year
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆98Updated last year
- ☆197Updated 3 months ago
- Lamport's Bakery Algorithm Demonstrated in Python☆96Updated last year
- ☆187Updated 11 months ago
- ☆34Updated 7 months ago
- Run and explore Llama models locally with minimal dependencies on CPU☆191Updated 10 months ago
- This is a numpy implementation of the Skip-gram algorithm described in Mikolov et al's Word2Vec paper. It is intended for didactic purpos…☆36Updated 2 years ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 4 months ago
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆76Updated 2 years ago
- A BERT that you can train on a (gaming) laptop.☆209Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆201Updated 3 weeks ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆165Updated last year
- Designing bridge trusses with Pytorch autograd☆61Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- This repo contains a new way to use bloom filters to do lossless video compression☆248Updated 2 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆194Updated 9 months ago
- A GPU Accelerated Binary Vector Store☆47Updated 6 months ago
- A tiny autograd engine with a Jax-like API☆74Updated last month