mlecauchois / micrograd-cuda
☆234Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for micrograd-cuda
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 2 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated last year
- Deep learning accelerator architectures requiring half the multipliers☆263Updated 7 months ago
- A BERT that you can train on a (gaming) laptop.☆211Updated last year
- An implementation of bucketMul LLM inference☆214Updated 4 months ago
- throwaway GPT inference☆139Updated 5 months ago
- A pure NumPy implementation of Mamba.☆216Updated 4 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆322Updated 5 months ago
- Grandmaster-Level Chess Without Search☆488Updated last month
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- High-Performance FP32 Matrix Multiplication on CPU☆301Updated this week
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆271Updated this week
- Richard is gaining power☆176Updated 3 months ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆626Updated 7 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆207Updated 11 months ago
- ☆223Updated last month
- Visualize the intermediate output of Mistral 7B☆313Updated 9 months ago
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆95Updated 8 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆193Updated this week
- Run and explore Llama models locally with minimal dependencies on CPU☆183Updated last month
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆350Updated 2 months ago
- Automated, smooth, N'th order derivatives of non-uniformly sampled time series data☆219Updated last month
- Solve puzzles. Learn CUDA.☆61Updated 11 months ago
- ☆179Updated 2 months ago
- Lamport's Bakery Algorithm Demonstrated in Python☆95Updated 10 months ago
- GGUF implementation in C as a library and a tools CLI program☆244Updated 4 months ago
- Diffusion on syntax trees for program synthesis☆420Updated 4 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆106Updated 11 months ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆270Updated 3 weeks ago
- Designing bridge trusses with Pytorch autograd☆61Updated 9 months ago