trevorpogue / algebraic-nnhwLinks
Algebraic enhancements for GEMM & AI accelerators
☆284Updated 9 months ago
Alternatives and similar repositories for algebraic-nnhw
Users that are interested in algebraic-nnhw are comparing it to the libraries listed below
Sorting:
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆215Updated 2 years ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆104Updated last year
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆209Updated last year
- ☆249Updated last year
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆254Updated 2 years ago
- Tensor library & inference framework for machine learning☆115Updated 2 months ago
- Docker-based inference engine for AMD GPUs☆230Updated last year
- Richard is gaining power☆200Updated 6 months ago
- Sequential Logic☆114Updated last week
- ☆36Updated 3 months ago
- ☆191Updated last year
- ☆199Updated 7 months ago
- throwaway GPT inference☆141Updated last year
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆100Updated last year
- Lamport's Bakery Algorithm Demonstrated in Python☆95Updated last year
- Agent Based Model on GPU using CUDA 12.2.1 and OpenGL 4.5 (CUDA OpenGL interop) on Windows/Linux☆75Updated 9 months ago
- Run and explore Llama models locally with minimal dependencies on CPU☆190Updated last year
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆74Updated 3 years ago
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆169Updated last year
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- A BERT that you can train on a (gaming) laptop.☆210Updated 2 years ago
- Heirarchical Navigable Small Worlds☆101Updated 4 months ago
- ☆126Updated 6 months ago
- R.L. methods and techniques.☆199Updated last year
- This repo contains a new way to use bloom filters to do lossless video compression☆250Updated 6 months ago
- Experiments with applying Fourier transofrms to various plane-filling curves and patterns☆66Updated 2 years ago
- ☆254Updated 2 years ago
- Optimally allocate poker chips using constrained, nonlinear optimization☆174Updated last year
- What impact does floating point precision have on Mandelbrot set calculations?☆109Updated 2 years ago
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆225Updated last week