GX-XinGao / GRALinks
The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"
☆34Updated 3 months ago
Alternatives and similar repositories for GRA
Users that are interested in GRA are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆64Updated 4 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆76Updated 2 weeks ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆30Updated last month
- ☆10Updated 5 months ago
- ☆89Updated 10 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆40Updated 6 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆45Updated 7 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆68Updated last week
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆90Updated last month
- ☆13Updated 7 months ago
- ☆21Updated 5 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆39Updated 11 months ago
- ☆30Updated 2 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆51Updated 9 months ago
- ☆42Updated last month
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆42Updated 2 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆98Updated 5 months ago
- ☆45Updated last week
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆57Updated 2 months ago
- ☆59Updated 10 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- ☆49Updated 3 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆111Updated 4 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆23Updated last year
- SSRL: Self-Search Reinforcement Learning☆131Updated 3 weeks ago
- ☆89Updated 4 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 9 months ago
- ☆92Updated 3 weeks ago
- JudgeLRM: Large Reasoning Models as a Judge☆36Updated 5 months ago