cmu-l3 / anlp-fall2025-codeLinks
Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/
☆49Updated 3 weeks ago
Alternatives and similar repositories for anlp-fall2025-code
Users that are interested in anlp-fall2025-code are comparing it to the libraries listed below
Sorting:
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Updated 10 months ago
- ☆105Updated 6 months ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆231Updated 2 weeks ago
- SSRL: Self-Search Reinforcement Learning☆206Updated 5 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 4 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 4 months ago
- ☆466Updated 5 months ago
- ☆394Updated last week
- Lightly-reviewed collection of community environments☆210Updated 2 weeks ago
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated 2 months ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆164Updated 6 months ago
- Open source interpretability artefacts for R1.☆170Updated 9 months ago
- Universal Reasoning Model☆122Updated 3 weeks ago
- ☆44Updated 6 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆177Updated 3 weeks ago
- ☆191Updated 2 weeks ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 11 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆175Updated 4 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆593Updated 4 months ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆297Updated this week
- qwen3 experiments☆34Updated 7 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- When Reasoning Meets Its Laws☆35Updated last month
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 10 months ago
- ☆232Updated 2 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated 2 weeks ago
- Exploring Applications of GRPO☆251Updated 5 months ago
- Open-source release accompanying Gao et al. 2025☆501Updated 2 months ago
- ☆18Updated 7 months ago