stanford-cs336 / spring2025-lecturesLinks
☆703Updated last month
Alternatives and similar repositories for spring2025-lectures
Users that are interested in spring2025-lectures are comparing it to the libraries listed below
Sorting:
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆348Updated 3 months ago
- ☆316Updated 6 months ago
- My learning notes/codes for ML SYS.☆2,854Updated this week
- 🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"☆720Updated 3 months ago
- A repository sharing the literatures about large language models☆95Updated last week
- slime is a LLM post-training framework aiming for RL Scaling.☆596Updated this week
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆509Updated 2 weeks ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,421Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,023Updated 2 weeks ago
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆822Updated 3 weeks ago
- Awesome RL Reasoning Recipes ("Triple R")☆745Updated last month
- Large Language Model (LLM) Systems Paper List☆1,362Updated this week
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆58Updated 3 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,861Updated this week
- A Telegram bot to recommend arXiv papers☆276Updated 3 months ago
- LLaMA 2 implemented from scratch in PyTorch☆337Updated last year
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆497Updated last week
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆252Updated 6 months ago
- Notes and commented code for RLHF (PPO)☆97Updated last year
- TTRL: Test-Time Reinforcement Learning☆704Updated 2 weeks ago
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆1,411Updated last week
- Paper list for Efficient Reasoning.☆541Updated 3 weeks ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆574Updated last week
- ☆585Updated 3 months ago
- The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆385Updated 3 weeks ago
- Textbook on reinforcement learning from human feedback☆1,083Updated last week
- Survey on LLM Agents (Published on CoLing 2025)☆337Updated 2 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,469Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,588Updated last week
- Learning material for CMU10-714: Deep Learning System☆262Updated last year