EvanZhuang / AgenticLULinks
Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).
☆10Updated last month
Alternatives and similar repositories for AgenticLU
Users that are interested in AgenticLU are comparing it to the libraries listed below
Sorting:
- ☆19Updated 7 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆19Updated last year
- ☆14Updated 9 months ago
- ☆16Updated last year
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated last month
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆27Updated 3 weeks ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- ☆44Updated 3 weeks ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 10 months ago
- ☆45Updated last month
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆31Updated 2 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- ☆50Updated 8 months ago
- ☆38Updated 2 months ago
- ☆17Updated 3 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆42Updated 8 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆18Updated 6 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆77Updated last month
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆32Updated last month
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆39Updated last year
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated 2 months ago
- ☆50Updated 4 months ago
- ☆92Updated 11 months ago
- ☆19Updated 4 months ago
- ☆24Updated last year
- [EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆66Updated 6 months ago
- ☆18Updated 2 weeks ago
- ☆23Updated last year