trevorpogue / algebraic-nnhwLinks
Algebraic enhancements for GEMM & AI accelerators
☆277Updated 4 months ago
Alternatives and similar repositories for algebraic-nnhw
Users that are interested in algebraic-nnhw are comparing it to the libraries listed below
Sorting:
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆252Updated last year
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆203Updated 10 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆100Updated 9 months ago
- ☆248Updated last year
- Sequential Logic☆111Updated this week
- Tensor library & inference framework for machine learning☆101Updated this week
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆97Updated last year
- Docker-based inference engine for AMD GPUs☆231Updated 9 months ago
- ☆196Updated 2 months ago
- Richard is gaining power☆192Updated 3 weeks ago
- Lamport's Bakery Algorithm Demonstrated in Python☆96Updated last year
- throwaway GPT inference☆140Updated last year
- ☆188Updated 10 months ago
- ☆252Updated last year
- This is a numpy implementation of the Skip-gram algorithm described in Mikolov et al's Word2Vec paper. It is intended for didactic purpos…☆36Updated 2 years ago
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆162Updated last year
- A BERT that you can train on a (gaming) laptop.☆209Updated last year
- ☆34Updated 5 months ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- Run and explore Llama models locally with minimal dependencies on CPU☆191Updated 9 months ago
- R.L. methods and techniques.☆196Updated 7 months ago
- Designing bridge trusses with Pytorch autograd☆61Updated last year
- Agent Based Model on GPU using CUDA 12.2.1 and OpenGL 4.5 (CUDA OpenGL interop) on Windows/Linux☆74Updated 4 months ago
- This repo contains a new way to use bloom filters to do lossless video compression☆245Updated last month
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- Heirarchical Navigable Small Worlds☆97Updated 3 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- ✨ rudimentary simulation of the three-body problem☆154Updated last year