stanford-cs336 / spring2025-lecturesLinks
☆1,072Updated last month
Alternatives and similar repositories for spring2025-lectures
Users that are interested in spring2025-lectures are comparing it to the libraries listed below
Sorting:
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆581Updated last month
- ☆349Updated 8 months ago
- My learning notes/codes for ML SYS.☆3,372Updated this week
- My implementation of Stanford CS336 assignments.☆116Updated last month
- slime is a LLM post-training framework aiming for RL Scaling.☆1,420Updated this week
- Large Language Model (LLM) Systems Paper List☆1,458Updated last week
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆1,755Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,518Updated 3 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆269Updated 7 months ago
- A repository sharing the literatures about large language models☆100Updated last month
- Materials for learning SGLang☆549Updated this week
- 🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"☆822Updated 5 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,011Updated last month
- Learning material for CMU10-714: Deep Learning System☆270Updated last year
- Distributed RL System for LLM Reasoning☆2,393Updated this week
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆1,893Updated 3 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆765Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,236Updated 2 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,693Updated last month
- LLaMA 2 implemented from scratch in PyTorch☆347Updated last year
- pytorch distribute tutorials☆147Updated 2 months ago
- Building DeepSeek R1 from Scratch☆684Updated 5 months ago
- ☆272Updated 3 months ago
- ☆405Updated this week
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,290Updated 3 weeks ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,537Updated 4 months ago
- Textbook on reinforcement learning from human feedback☆1,185Updated last week
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆400Updated 4 months ago
- An ML Systems Onboarding list☆877Updated 7 months ago
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆177Updated last year