karpathy / randomfunLinks

Notebooks and various random fun

☆1,097

Alternatives and similar repositories for randomfun

Users that are interested in randomfun are comparing it to the libraries listed below

Sorting:

tysam-code / hlb-CIFAR10
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
☆1,271Updated 7 months ago
explosion / curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
☆892Updated last year
HazyResearch / meerkat
Creative interactive views of any dataset.
☆843Updated 7 months ago
karpathy / lecun1989-repro
Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …
☆642Updated last year
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,339Updated last year
karpathy / arxiv-sanity-lite
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …
☆1,324Updated 2 years ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,006Updated 11 months ago
Kaixhin / grokking-pytorch
The Hitchiker's Guide to PyTorch
☆1,198Updated 3 years ago
osanseviero / ml_timeline
☆590Updated 2 years ago
keras-team / keras-core
A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.
☆1,272Updated last month
markriedl / transformer-walkthrough
A walkthrough of transformer architecture code
☆351Updated last year
google-deepmind / xmanager
A platform for managing machine learning experiments
☆860Updated last week
fastai / course22p2
course.fast.ai 2022 part 2
☆500Updated last year
srush / Autodiff-Puzzles
☆440Updated 9 months ago
xl0 / lovely-tensors
Tensors, for human consumption
☆1,270Updated last month
keerthanpg / talktopapers
☆211Updated 2 years ago
dair-ai / Transformers-Recipe
🧠 A study guide to learn about Transformers
☆1,600Updated 2 years ago
sanjeevanahilan / nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
☆291Updated last year
jacobhilton / deep_learning_curriculum
Language model alignment-focused deep learning curriculum
☆1,431Updated 11 months ago
srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆1,064Updated last year
srush / raspy
An interactive exploration of Transformer programming.
☆265Updated last year
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆349Updated 11 months ago
fchollet / stable-diffusion-tensorflow
TensorFlow/Keras port of Stable Diffusion
☆321Updated 2 years ago
evanmiller / LLM-Reading-List
LLM papers I'm reading, mostly on inference and model compression
☆735Updated last year
karpathy / transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆131Updated 3 years ago
srush / MiniChain
A tiny library for coding with large language models.
☆1,234Updated last year
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆808Updated last week
CalculatedContent / WeightWatcher
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
☆1,626Updated last month
HazyResearch / safari
Convolutions for Sequence Modeling
☆893Updated last year
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆715Updated last year