YYZhang2025/Stanford-CS336

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YYZhang2025/Stanford-CS336)

YYZhang2025 / Stanford-CS336

My Solution and Notes for the Stanford CS336: LLM from scratch

☆264

Alternatives and similar repositories for Stanford-CS336

Users that are interested in Stanford-CS336 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Louisym / Stanford-CS336-spring25
View on GitHub
My implementations on the 5 assignments of cs336
☆318Nov 23, 2025Updated 8 months ago
wingAGI / cs336-assignments-answer
View on GitHub
My implementation of Stanford CS336 assignments.
☆246Mar 15, 2026Updated 4 months ago
SingularGuyLeBorn / Awesome-CS336-NoteForEveryone
View on GitHub
☆143Jan 18, 2026Updated 6 months ago
heng380 / cs336_assignment2
View on GitHub
CS33作业 2 的代码和飞书 qa, 这个作业太恶心了, 绝对是所有作业里面花的最久的
☆24Jul 17, 2025Updated last year
heng380 / cs336_assignment-5
View on GitHub
CS336 作业 5 实现, 附加作业里面的 dpo/rlhf 也完成了, 消融实验分析也放在飞书文档里面了, 仅供参考
☆39Sep 27, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flwfdd / cs336-assignment1-basics
View on GitHub
（包含完整代码和坑点记录）Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆41Jan 22, 2026Updated 6 months ago
wingAGI / clean-llm
View on GitHub
🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据，训练Tokenizer，预训练、SFT、GRPO！
☆56Aug 12, 2025Updated 11 months ago
heng380 / cs336-assignment1
View on GitHub
cs336作业 1 实现, 我把 qa 问题也放在飞书链接里面了, 仅供参考
☆35Jul 3, 2025Updated last year
stanford-cs336 / lectures
View on GitHub
☆3,530May 28, 2026Updated last month
stanford-cs336 / assignment5-alignment
View on GitHub
☆187Jun 4, 2026Updated last month
Spectual / stanford-cs336-a1
View on GitHub
Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆77Jul 7, 2025Updated last year
datawhalechina / diy-llm
View on GitHub
🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱…
☆1,073Updated this week
thepowerfuldeez / cs336_solutions
View on GitHub
Here's my solutions to all assignments of Stanford CS336 course: LLM from Scratch
☆29Sep 26, 2025Updated 10 months ago
mocibb / cs336
View on GitHub
☆95Jul 20, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Zian-2 / cs336_assignments_and_notes
View on GitHub
cs336（2025-Spring）的作业参考和较详尽的中文笔记
☆33Mar 13, 2026Updated 4 months ago
yzhbradoodrrpurp / EECS498
View on GitHub
Collection of assignments and resources from Umich EECS498 Fall 2019.
☆20Aug 25, 2025Updated 11 months ago
TeenLucifer / grpo_reproduce
View on GitHub
A comparison of deepseek grpo and qwen gspo on Qwen2.5-1.5B-Instruct fine tunning.
☆171Mar 28, 2026Updated 3 months ago
jshn9515 / deep-learning-notes
View on GitHub
Personal deep learning study notes and tutorial-style notebooks
☆564Updated this week
Sherlock1956 / TransformerFromScratch
View on GitHub
☆55Nov 22, 2025Updated 8 months ago
WangYuHang-cmd / CS336
View on GitHub
A private repo for learning CS336
☆38Sep 2, 2025Updated 10 months ago
stanford-cs336 / assignment2-systems
View on GitHub
Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch
☆282May 1, 2026Updated 2 months ago
donglinkang2021 / cs336-assignment1-basics
View on GitHub
Implementation of my CS336 assignment1
☆47Dec 23, 2025Updated 7 months ago
ml-researcher / VAE
View on GitHub
☆11Oct 8, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
abdelfattah-lab / SplitReason
View on GitHub
☆20Mar 18, 2026Updated 4 months ago
ckd0817 / LLM-Interview-Code
View on GitHub
☆736Mar 26, 2026Updated 4 months ago
jingyaogong / minimind
View on GitHub
🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!
☆53,838Updated this week
wind-wing234 / cs336-assignment1-basics
View on GitHub
Stanford "Language Modeling from Scratch" CS336 Assignment1 - 斯坦福大学 CS336 课程作业1 个人实现，仅供参考
☆45Jun 15, 2025Updated last year
Mionger / MIPS-CPU
View on GitHub
Project of hardware course group in Tongji University
☆15Dec 26, 2019Updated 6 years ago
Ashside / LLM-HandCoding-Interview
View on GitHub
收集为大模型面试准备的手撕代码
☆47Mar 15, 2026Updated 4 months ago
thomaschlt / mla.c
View on GitHub
Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.
☆18Jan 15, 2025Updated last year
datawhalechina / happy-llm
View on GitHub
📚 从零开始构建大模型
☆32,338May 6, 2026Updated 2 months ago
czbnlp / cs336-5
View on GitHub
☆24Feb 27, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WKQ9411 / Mini-LLM
View on GitHub
This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models …
☆284Jun 14, 2026Updated last month
EverMind-AI / EverMe
View on GitHub
Open-source CLI and Agent plugin suite for EverMe — cross-device, cross-Agent personal memory for AI Agents.
☆40Updated this week
salmon1802 / O_o
View on GitHub
TAAC2025初赛第十四名O_o队伍代码
☆136Oct 27, 2025Updated 8 months ago
gouzigouzi / attention-residuals-for-chinese-llms
View on GitHub
A Chinese-focused PyTorch framework for exploring Attention Residuals in Qwen3-style causal LMs, with baseline, Block AttnRes, Full AttnR…
☆19May 3, 2026Updated 2 months ago
AsyncOS / AsyncOS.github.io
View on GitHub
☆12Updated this week
yuandaxia2001 / HealthAI-2025
View on GitHub
☆170Mar 18, 2026Updated 4 months ago
Tongyun1 / from-minimind-to-more
View on GitHub
🎓从0开始训练一个大模型Minimind项目的超详细解析，包括但不限于用到的架构，算法，以及大模型面试经验
☆1,010May 25, 2026Updated 2 months ago