cmu-l3 / anlp-fall2025-codeLinks
Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/
☆46Updated 2 weeks ago
Alternatives and similar repositories for anlp-fall2025-code
Users that are interested in anlp-fall2025-code are comparing it to the libraries listed below
Sorting:
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- ☆401Updated last month
- Official implementation of GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆74Updated 3 weeks ago
- ☆465Updated 5 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 10 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 4 months ago
- Universal Reasoning Model☆121Updated 2 weeks ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆162Updated 6 months ago
- SSRL: Self-Search Reinforcement Learning☆205Updated 5 months ago
- ☆100Updated 6 months ago
- Memory optimized Mixture of Experts☆72Updated 6 months ago
- Open source interpretability artefacts for R1.☆169Updated 9 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated last week
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆175Updated 2 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- An extension of the nanoGPT repository for training small MOE models.☆231Updated 10 months ago
- ☆42Updated last year
- Evaluation of LLMs on latest math competitions☆213Updated last month
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆71Updated 10 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆57Updated this week
- ☆45Updated 8 months ago
- ☆230Updated 2 months ago
- ☆346Updated this week
- Storing long contexts in tiny caches with self-study☆231Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 5 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Updated 9 months ago
- qwen3 experiments☆34Updated 7 months ago