karpathy / micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
☆10,434Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for micrograd
- An autoregressive character-level language model for making more things☆2,582Updated 5 months ago
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆26,748Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,168Updated 4 months ago
- Inference Llama 2 in one file of pure C☆17,444Updated 3 months ago
- LLM training in simple, raw C/CUDA☆24,335Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37,189Updated 2 months ago
- ☆3,554Updated 9 months ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆20,105Updated 2 months ago
- Neural Networks: Zero to Hero☆11,846Updated 2 months ago
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,236Updated last year
- Video+code lecture on building nanoGPT from scratch☆3,573Updated 2 months ago
- Tensor library for machine learning☆11,160Updated this week
- Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposes☆1,606Updated 3 years ago
- Explanation to key concepts in ML☆7,302Updated this week
- Solve puzzles. Improve your pytorch.☆3,259Updated 3 months ago
- Machine Learning Engineering Open Book☆11,589Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,835Updated 6 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,590Updated this week
- MLX: An array framework for Apple silicon☆17,040Updated this week
- Train transformer language models with reinforcement learning.☆9,967Updated this week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,650Updated 3 months ago
- Development repository for the Triton language and compiler☆13,311Updated this week
- ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Libr…☆2,393Updated 4 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆5,988Updated 2 months ago
- Notebooks and various random fun☆1,080Updated last year
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆12,312Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,647Updated 3 weeks ago
- Some ipython notebooks implementing AI algorithms☆1,267Updated last month
- 🧠 A study guide to learn about Transformers☆1,539Updated last year
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …☆1,176Updated last year