EvanZhuang / AgenticLULinks
Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).
☆12Updated 4 months ago
Alternatives and similar repositories for AgenticLU
Users that are interested in AgenticLU are comparing it to the libraries listed below
Sorting:
- ☆19Updated 10 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- ☆50Updated 11 months ago
- ☆23Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 3 months ago
- ☆67Updated 10 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 5 months ago
- ☆16Updated last year
- ☆46Updated 7 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Updated 3 months ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 5 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆22Updated 10 months ago
- ☆14Updated last year
- ☆47Updated 3 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆33Updated 2 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆21Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Updated this week
- ☆95Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- ☆59Updated 2 weeks ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Updated 7 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27Updated 8 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆40Updated last year
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Updated 3 months ago
- ☆23Updated 6 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆19Updated 9 months ago