stanford-cs336 / spring2025-lecturesLinks
☆2,101Updated 3 weeks ago
Alternatives and similar repositories for spring2025-lectures
Users that are interested in spring2025-lectures are comparing it to the libraries listed below
Sorting:
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆933Updated 2 months ago
- My learning notes/codes for ML SYS.☆4,201Updated last week
- My implementation of Stanford CS336 assignments.☆200Updated 4 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆560Updated last month
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,069Updated 2 weeks ago
- 个人构建MoE大模型:从预训练到DPO的完整实践☆1,852Updated 2 weeks ago
- ☆86Updated 4 months ago
- ☆399Updated 10 months ago
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆474Updated 7 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,151Updated last month
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆566Updated 7 months ago
- slime is an LLM post-training framework for RL Scaling.☆2,543Updated this week
- ☆1,204Updated 2 months ago
- ☆75Updated last month
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆842Updated 2 months ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力 ,又更贴近业务和基础知识一点☆439Updated 10 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,015Updated this week
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆1,705Updated last month
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆48Updated 3 months ago
- Large Language Model (LLM) Systems Paper List☆1,628Updated this week
- A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.☆3,421Updated 3 weeks ago
- Building DeepSeek R1 from Scratch☆717Updated 8 months ago
- ☆43Updated 3 weeks ago
- Learning material for CMU10-714: Deep Learning System☆283Updated last year
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆77Updated 4 months ago
- ☆325Updated 6 months ago
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆1,844Updated this week
- modern AI for beginners☆178Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,892Updated 2 months ago
- Textbook on reinforcement learning from human feedback☆1,329Updated this week