karpathy / randomfun
Notebooks and various random fun
☆1,094Updated last year
Alternatives and similar repositories for randomfun:
Users that are interested in randomfun are comparing it to the libraries listed below
- 🤖 A PyTorch library of curated Transformer models and their composable components☆884Updated last year
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …☆1,265Updated last year
- 🧠A study guide to learn about Transformers☆1,573Updated last year
- Tensors, for human consumption☆1,214Updated 4 months ago
- What would you do with 1000 H100s...☆1,035Updated last year
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆631Updated last year
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,251Updated 3 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,000Updated 7 months ago
- Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposes☆1,711Updated 3 years ago
- ☆589Updated last year
- An interactive exploration of Transformer programming.☆262Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,327Updated 10 months ago
- A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.☆1,267Updated 3 weeks ago
- Puzzles for exploring transformers☆342Updated last year
- An autoregressive character-level language model for making more things☆3,005Updated 10 months ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆342Updated 8 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆709Updated last year
- Language model alignment-focused deep learning curriculum☆1,370Updated 7 months ago
- ☆428Updated 5 months ago
- Aspires to help the influx of bioRxiv / medRxiv papers on COVID-19☆362Updated 4 years ago
- A walkthrough of transformer architecture code☆338Updated last year
- ☆3,917Updated last year
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆103Updated 6 years ago
- ☆530Updated last year
- fast vector database made in numpy☆752Updated 11 months ago
- MinT: Minimal Transformer Library and Tutorials☆253Updated 2 years ago
- The "tl;dr" on a few notable transformer papers (pre-2022).☆190Updated 2 years ago
- A library for mechanistic interpretability of GPT-style language models☆2,061Updated this week
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,570Updated 7 months ago
- If tinygrad wasn't small enough for you...☆710Updated last year