stanford-cs336 / assignment1-basicsLinks
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆910Updated 2 months ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆2,053Updated 3 weeks ago
- ☆393Updated 10 months ago
- My implementation of Stanford CS336 assignments.☆200Updated 4 months ago
- Learning material for CMU10-714: Deep Learning System☆283Updated last year
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆111Updated 3 months ago
- ☆86Updated 3 months ago
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆468Updated 7 months ago
- My learning notes/codes for ML SYS.☆4,136Updated last week
- A repository sharing the literatures about large language models☆103Updated 4 months ago
- slime is an LLM post-training framework for RL Scaling.☆2,480Updated this week
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,039Updated last week
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆77Updated 4 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆300Updated 10 months ago
- Solutions for CS224n (2022)☆71Updated last year
- Notes and commented code for RLHF (PPO)☆114Updated last year
- Large Language Model (LLM) Systems Paper List☆1,602Updated this week
- ☆543Updated last week
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆47Updated 3 months ago
- ☆216Updated 10 months ago
- The newest solution for CS224n: Stanford NLP.(作业代码实现)☆72Updated 2 years ago
- ☆103Updated this week
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆183Updated last year
- My Solution and Notes for the Stanford CS336: LLM from scratch☆65Updated 2 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆566Updated 11 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,662Updated 6 months ago
- ☆99Updated last year
- pytorch distribute tutorials☆156Updated 5 months ago
- My solutions and supplemental resources for CS224N in the spring of 2024.☆30Updated last year
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆269Updated last year
- ☆71Updated 3 months ago