stanford-cs336 / spring2024-assignment2-systemsLinks
☆16Updated last year
Alternatives and similar repositories for spring2024-assignment2-systems
Users that are interested in spring2024-assignment2-systems are comparing it to the libraries listed below
Sorting:
- ☆349Updated 7 months ago
- ☆211Updated 6 months ago
- ☆444Updated 10 months ago
- ☆526Updated last year
- ☆60Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆190Updated 2 months ago
- Puzzles for exploring transformers☆366Updated 2 years ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆221Updated 2 weeks ago
- ☆92Updated 11 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆523Updated this week
- ☆380Updated this week
- A puzzle to learn about prompting☆132Updated 2 years ago
- An example starter repo for Python projects☆295Updated 2 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆812Updated 3 weeks ago
- Best practices & guides on how to write distributed pytorch training code☆467Updated 6 months ago
- What would you do with 1000 H100s...☆1,087Updated last year
- An interactive exploration of Transformer programming.☆269Updated last year
- ☆42Updated 7 months ago
- Building blocks for foundation models.☆532Updated last year
- ☆162Updated last year
- Highly commented implementations of Transformers in PyTorch☆137Updated 2 years ago
- An extension of the nanoGPT repository for training small MOE models.☆178Updated 5 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆137Updated last year
- ☆98Updated 2 weeks ago
- 🧠 Starter templates for doing interpretability research☆73Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆643Updated last week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆250Updated 2 weeks ago
- nanoGPT-like codebase for LLM training☆102Updated 3 months ago
- Collection of my assignments and work in the class MATH51 at Stanford☆97Updated 10 years ago