stanford-cs336 / assignment1-basicsLinks
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆581Updated 3 weeks ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆1,072Updated last month
- ☆349Updated 8 months ago
- ☆192Updated 7 months ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆58Updated last month
- LLaMA 2 implemented from scratch in PyTorch☆347Updated last year
- ☆92Updated 11 months ago
- An ML Systems Onboarding list☆877Updated 7 months ago
- Learning material for CMU10-714: Deep Learning System☆268Updated last year
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆177Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,693Updated last month
- My implementation of Stanford CS336 assignments.☆106Updated last month
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆64Updated 4 months ago
- A repository sharing the literatures about large language models☆100Updated last month
- Solutions for CS224n (2022)☆66Updated last year
- ☆362Updated 4 months ago
- An extension of the nanoGPT repository for training small MOE models.☆178Updated 5 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆520Updated last month
- Textbook on reinforcement learning from human feedback☆1,185Updated last week
- Building blocks for foundation models.☆532Updated last year
- GPU Kernels☆193Updated 4 months ago
- Puzzles for learning Triton, play it with minimal environment configuration!☆489Updated 8 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,537Updated 4 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆391Updated 5 months ago
- Notes and commented code for RLHF (PPO)☆104Updated last year
- ☆18Updated last month
- ☆39Updated 5 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆269Updated 7 months ago
- slime is a LLM post-training framework aiming for RL Scaling.☆1,420Updated this week
- Puzzles for learning Triton☆1,948Updated 9 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆527Updated 8 months ago