karpathy / randomfun
Notebooks and various random fun
☆1,096Updated 2 years ago
Alternatives and similar repositories for randomfun:
Users that are interested in randomfun are comparing it to the libraries listed below
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,253Updated 4 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,331Updated 10 months ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆885Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆122Updated 2 years ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,003Updated 8 months ago
- ☆589Updated last year
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,559Updated last week
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆635Updated last year
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …☆1,278Updated last year
- Puzzles for exploring transformers☆344Updated 2 years ago
- What would you do with 1000 H100s...☆1,043Updated last year
- Creative interactive views of any dataset.☆838Updated 4 months ago
- ☆431Updated 6 months ago
- An interactive exploration of Transformer programming.☆263Updated last year
- Aspires to help the influx of bioRxiv / medRxiv papers on COVID-19☆362Updated 5 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆345Updated 9 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆711Updated last year
- ☆602Updated last year
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆289Updated last year
- Language Modeling with the H3 State Space Model☆520Updated last year
- ☆532Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,566Updated last year
- Tensors, for human consumption☆1,249Updated 5 months ago
- A platform for managing machine learning experiments☆848Updated 2 weeks ago
- ☆347Updated last week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,774Updated last week
- Convolutions for Sequence Modeling☆883Updated 10 months ago
- Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposes☆1,723Updated 3 years ago
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆576Updated 10 months ago
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,084Updated last month