karpathy / arxiv-sanity-lite
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
☆1,176Updated last year
Related projects ⓘ
Alternatives and complementary repositories for arxiv-sanity-lite
- Tensors, for human consumption☆1,111Updated last week
- Cramming the training of a (BERT-type) language model into limited compute.☆1,294Updated 4 months ago
- ML Collections is a library of Python Collections designed for ML use cases.☆893Updated 3 months ago
- Notebooks and various random fun☆1,080Updated last year
- What would you do with 1000 H100s...☆892Updated 9 months ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆601Updated 9 months ago
- maximal update parametrization (µP)☆1,396Updated 3 months ago
- ☆748Updated last month
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,470Updated last month
- Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.☆1,400Updated 3 months ago
- Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation☆1,444Updated last year
- functorch is JAX-like composable function transforms for PyTorch.☆1,395Updated this week
- A platform for managing machine learning experiments☆816Updated 3 months ago
- JAX - A curated list of resources https://github.com/google/jax☆1,538Updated 3 months ago
- 🧠 A study guide to learn about Transformers☆1,539Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆864Updated 6 months ago
- ☆506Updated 9 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,673Updated last week
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆821Updated 2 years ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆425Updated 2 months ago
- Puzzles for exploring transformers☆321Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆851Updated last year
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,857Updated 4 months ago
- Structured state space sequence models☆2,454Updated 3 months ago
- Version control for machine learning☆1,650Updated 2 months ago
- JAX-based neural network library☆2,894Updated last week
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆479Updated 2 weeks ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆965Updated 2 months ago
- 100 exercises to learn JAX☆567Updated 2 years ago
- ☆386Updated 3 weeks ago