stanford-cs336 / spring2025-lecturesLinks
☆1,467Updated this week
Alternatives and similar repositories for spring2025-lectures
Users that are interested in spring2025-lectures are comparing it to the libraries listed below
Sorting:
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆790Updated last month
- My implementation of Stanford CS336 assignments.☆153Updated 3 months ago
- My learning notes/codes for ML SYS.☆3,808Updated this week
- ☆373Updated 9 months ago
- slime is an LLM post-training framework for RL Scaling.☆2,091Updated this week
- A Survey of Reinforcement Learning for Large Reasoning Models☆1,718Updated this week
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆447Updated 6 months ago
- Learning material for CMU10-714: Deep Learning System☆279Updated last year
- Large Language Model (LLM) Systems Paper List☆1,532Updated this week
- 个人构建MoE大模型:从预训练到DPO的完整实践☆1,450Updated last week
- ☆864Updated last month
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆2,736Updated last week
- ☆67Updated 2 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,570Updated 5 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,089Updated 3 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆282Updated 9 months ago
- A repository sharing the literatures about large language models☆102Updated 3 months ago
- pytorch distribute tutorials☆152Updated 3 months ago
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆415Updated 6 months ago
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆64Updated 3 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,596Updated 5 months ago
- Latest Advances on System-2 Reasoning☆1,243Updated 4 months ago
- Material for gpu-mode lectures☆5,143Updated 2 weeks ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,365Updated 2 months ago
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆827Updated 3 weeks ago
- 记录我在cs336学习时的笔记和作业☆84Updated last month
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆525Updated this week
- ☆303Updated 5 months ago
- modern AI for beginners☆165Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,846Updated last month