eduardoleao052 / Autograd-from-scratch
Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.
☆111Updated 11 months ago
Alternatives and similar repositories for Autograd-from-scratch:
Users that are interested in Autograd-from-scratch are comparing it to the libraries listed below
- Highly commented implementations of Transformers in PyTorch☆132Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithm☆47Updated 4 months ago
- ML/DL Math and Method notes☆59Updated last year
- Implementation snake game based on Diffusion model☆88Updated 2 months ago
- A deep learning framework created from scratch with Python and NumPy☆236Updated 2 years ago
- A really tiny autograd engine☆90Updated 11 months ago
- Autograd to GPT-2 completely from scratch☆112Updated 2 weeks ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated 10 months ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- A pure NumPy implementation of Mamba.☆220Updated 8 months ago
- Annotated version of the Mamba paper☆477Updated last year
- ☆149Updated 7 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆167Updated this week
- Fast bare-bones BPE for modern tokenizer training☆151Updated 5 months ago
- A Jax-based library for designing and training transformer models from scratch.☆282Updated 6 months ago
- Best practices & guides on how to write distributed pytorch training code☆377Updated last month
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆282Updated 3 months ago
- ☆103Updated 7 months ago
- ☆138Updated 2 months ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆146Updated 9 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆87Updated last year
- Triton implementation of GPT/LLAMA☆16Updated 7 months ago
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆65Updated last year
- ☆38Updated 2 weeks ago
- I learn about and explain quantization☆26Updated 11 months ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆61Updated 10 months ago
- Notebooks for the "JAX in Action" book☆134Updated 9 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆98Updated 3 months ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆115Updated 10 months ago
- Custom kernels in Triton language for accelerating LLMs☆18Updated 11 months ago