stanford-cs336 / spring2025-lecturesLinks
☆924Updated 2 weeks ago
Alternatives and similar repositories for spring2025-lectures
Users that are interested in spring2025-lectures are comparing it to the libraries listed below
Sorting:
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆499Updated last week
- ☆334Updated 7 months ago
- My learning notes/codes for ML SYS.☆3,153Updated this week
- slime is a LLM post-training framework aiming for RL Scaling.☆1,113Updated this week
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,929Updated 3 weeks ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,479Updated 2 months ago
- My implementation of Stanford CS336 assignments.☆77Updated 3 weeks ago
- Large Language Model (LLM) Systems Paper List☆1,409Updated this week
- Distributed RL System for LLM Reasoning☆2,135Updated this week
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆1,605Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,644Updated 3 weeks ago
- Textbook on reinforcement learning from human feedback☆1,147Updated 2 weeks ago
- Awesome RL Reasoning Recipes ("Triple R")☆762Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,055Updated last week
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆854Updated this week
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,231Updated 2 weeks ago
- 🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"☆778Updated 4 months ago
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆547Updated this week
- Learning material for CMU10-714: Deep Learning System☆264Updated last year
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆265Updated 7 months ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆61Updated 4 months ago
- Latest Advances on System-2 Reasoning☆1,207Updated last month
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆1,874Updated 2 months ago
- O1 Replication Journey☆1,998Updated 6 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,182Updated last week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,508Updated 3 months ago
- pytorch distribute tutorials☆143Updated last month
- Materials for learning SGLang☆515Updated 2 weeks ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆512Updated 3 weeks ago
- A repository sharing the literatures about large language models☆98Updated last month