stanford-cs336 / spring2024-lecturesLinks

☆334

Alternatives and similar repositories for spring2024-lectures

Users that are interested in spring2024-lectures are comparing it to the libraries listed below

Sorting:

stanford-cs336 / assignment1-basics
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆499Updated last week
stanford-cs336 / spring2025-lectures
☆924Updated 2 weeks ago
cmu-l3 / anlp-spring2025-code
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆61Updated 4 months ago
neubig / minllama-assignment
☆90Updated 10 months ago
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆512Updated 3 weeks ago
huggingface / picotron_tutorial
☆206Updated 5 months ago
srush / awesome-o1
A bibliography and survey of the papers surrounding o1
☆1,209Updated 8 months ago
NVIDIA / NeMo-Skills
A project to improve skills of large language models
☆501Updated this week
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆679Updated last week
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆578Updated this week
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆101Updated last year
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,055Updated last week
HazyResearch / aisys-building-blocks
Building blocks for foundation models.
☆525Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆164Updated 4 months ago
srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆1,079Updated last year
THUDM / slime
slime is a LLM post-training framework aiming for RL Scaling.
☆1,113Updated this week
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆425Updated last week
rwitten / HighPerfLLMs2024
☆518Updated last year
open-thought / tiny-grpo
Minimal hackable GRPO implementation
☆274Updated 6 months ago
project-numina / aimo-progress-prize
☆460Updated last year
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆463Updated 5 months ago
yuandong-tian / arXiv_recbot
A Telegram bot to recommend arXiv papers
☆281Updated 3 months ago
yihedeng9 / rlhf-summary-notes
A brief and partial summary of RLHF algorithms.
☆131Updated 5 months ago
hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆343Updated last year
marin-community / marin
☆347Updated this week
okhat / blog
☆294Updated 10 months ago
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,644Updated 3 weeks ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,110Updated 2 months ago
huggingface / Math-Verify
☆870Updated last month
openpsi-project / ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆307Updated 3 months ago