stanford-cs336 / assignment1-basicsLinks
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆1,003Updated 3 months ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆2,281Updated 3 weeks ago
- ☆403Updated last year
- My implementation of Stanford CS336 assignments.☆210Updated 5 months ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆140Updated 5 months ago
- ☆90Updated 5 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,178Updated last month
- Learning material for CMU10-714: Deep Learning System☆290Updated last year
- ☆89Updated 5 months ago
- A repository sharing the literatures about large language models☆107Updated this week
- ☆1,087Updated last week
- slime is an LLM post-training framework for RL Scaling.☆2,911Updated last week
- My solutions and supplemental resources for CS224N in the spring of 2024.☆32Updated last year
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆267Updated last year
- Textbook on reinforcement learning from human feedback☆1,364Updated this week
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆78Updated 5 months ago
- Large Language Model (LLM) Systems Paper List☆1,691Updated last week
- My learning notes for ML SYS.☆4,783Updated this week
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆513Updated 8 months ago
- ☆36Updated 5 months ago
- The newest solution for CS224n: Stanford NLP.(作业代码实现)☆72Updated 2 years ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆69Updated 8 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆741Updated 2 months ago
- ☆44Updated last month
- Solutions for CS224n (2022)☆72Updated last year
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆307Updated 11 months ago
- ☆1,351Updated 3 months ago
- LLaMA 2 implemented from scratch in PyTorch☆363Updated 2 years ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,718Updated 8 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,923Updated 4 months ago
- My Solution and Notes for the Stanford CS336: LLM from scratch☆73Updated last week