cmu-l3 / anlp-spring2025-codeLinks
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆66Updated 6 months ago
Alternatives and similar repositories for anlp-spring2025-code
Users that are interested in anlp-spring2025-code are comparing it to the libraries listed below
Sorting:
- ☆96Updated last year
- ☆373Updated 9 months ago
- Notes and commented code for RLHF (PPO)☆110Updated last year
- ☆76Updated last year
- minimal GRPO implementation from scratch☆98Updated 6 months ago
- ☆222Updated this week
- ☆151Updated 10 months ago
- An extension of the nanoGPT repository for training small MOE models.☆195Updated 6 months ago
- A brief and partial summary of RLHF algorithms.☆132Updated 7 months ago
- Direct Preference Optimization from scratch in PyTorch☆113Updated 6 months ago
- Notes on Direct Preference Optimization☆23Updated last year
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆535Updated 2 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆120Updated 4 months ago
- ☆51Updated 2 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆134Updated last year
- Minimal hackable GRPO implementation☆286Updated 8 months ago
- Physics of Language Models, Part 4☆247Updated 2 months ago
- ☆186Updated last year
- Tina: Tiny Reasoning Models via LoRA☆284Updated 2 weeks ago
- ☆437Updated last month
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 9 months ago
- PyTorch building blocks for the OLMo ecosystem☆301Updated last week
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆360Updated last week
- Survey of Small Language Models from Penn State, ...☆202Updated last month
- Distributed training (multi-node) of a Transformer model☆83Updated last year
- ☆143Updated 6 months ago
- Reproducible, flexible LLM evaluations☆251Updated 2 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆217Updated 2 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆258Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆353Updated 2 years ago