evintunador / autograd_engine_tutorialLinks
A from-scratch multi-difficulty-level tutorial on how pytorch, tensor flow, Jax, etc work
☆13Updated 11 months ago
Alternatives and similar repositories for autograd_engine_tutorial
Users that are interested in autograd_engine_tutorial are comparing it to the libraries listed below
Sorting:
- Simple Transformer in Jax☆142Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- Gradient descent is cool and all, but what if we could delete it?☆106Updated 5 months ago
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆52Updated 9 months ago
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆829Updated 6 months ago
- A deep dive on the history of robotics and the future of humanoids☆157Updated last year
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Frontier Models playing the board game Diplomacy.☆628Updated last month
- My runthrough of karpathy's lectures (with notes), building NN's from scratch, simple autoregressive language models, GPT models and lear…☆10Updated 2 years ago
- Incentivized Training over Wide Web with 1000x model compression.☆22Updated last year
- Tensor library with autograd using only Rust's standard library☆71Updated last year
- ☆29Updated last year
- ☆541Updated 6 months ago
- UNet diffusion model in pure CUDA☆661Updated last year
- smol models are fun too☆93Updated last year
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆115Updated last month
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆77Updated 11 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆198Updated 8 months ago
- Learnings and programs related to CUDA☆432Updated 7 months ago
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆78Updated last year
- Asynchronous P2P communication backend for decentralized pipeline parallelism☆41Updated 8 months ago
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- a tiny vectorstore implementation built with numpy.☆64Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆202Updated 2 years ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆238Updated 5 months ago
- a simplified version of Google's Gemma model to be used for learning☆26Updated last year
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆457Updated last year
- SIMD quantization kernels☆94Updated 5 months ago