tgautam03 / xGeMM
Accelerated General (FP32) Matrix Multiplication
☆71Updated last week
Alternatives and similar repositories for xGeMM:
Users that are interested in xGeMM are comparing it to the libraries listed below
- pytorch from scratch in pure C/CUDA and python☆38Updated 2 months ago
- Convoluting η-dimensional tensors over abstract manifolds.☆55Updated 2 weeks ago
- ☆47Updated 4 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 4 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆195Updated 3 weeks ago
- Visualizing some of the internals of a neural network during training and inference.☆70Updated 10 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆166Updated 4 months ago
- End-to-End LLM Guide☆97Updated 5 months ago
- Alex Krizhevsky's original code from Google Code☆190Updated 8 years ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆180Updated this week
- Andrej Kapathy's micrograd implemented in c☆29Updated 4 months ago
- a simple numpy alternative in C☆18Updated 2 months ago
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆142Updated last year
- Tensor library with autograd using only Rust's standard library☆62Updated 5 months ago
- a tiny vectorstore implementation built with numpy.☆56Updated 7 months ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆221Updated 4 months ago
- machine learning from absolute scratch in c. gradients, linear algebra ops & everything else without using any third party library!☆20Updated 4 months ago
- Solve puzzles to improve your tinygrad skills!☆91Updated 2 months ago
- ☆160Updated last month
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆105Updated 8 months ago
- A MNIST neural network written from scratch in Odin, visualised with Raylib☆161Updated 2 months ago
- A really tiny autograd engine☆88Updated 8 months ago
- learningggggggg 🐳☆125Updated 2 weeks ago
- Everything you need to know about Transformers! 🤖☆127Updated last year
- ☆83Updated last week
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 5 months ago
- Slides, notes, and materials for the workshop☆308Updated 6 months ago
- System built to find your lookalike with AI☆36Updated 7 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆229Updated last year
- ☆63Updated 2 months ago