GX-XinGao / GRALinks
The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"
☆22Updated last month
Alternatives and similar repositories for GRA
Users that are interested in GRA are comparing it to the libraries listed below
Sorting:
- ☆10Updated last month
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 2 months ago
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion (ACL 2025)☆22Updated last week
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆33Updated 6 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆36Updated 3 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆30Updated 2 weeks ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆38Updated 3 months ago
- ☆22Updated 11 months ago
- ☆22Updated 5 months ago
- ☆27Updated last month
- ☆18Updated this week
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 11 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆14Updated 3 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning". (By Xingh…☆15Updated 3 months ago
- ☆89Updated last week
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?☆23Updated 7 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆58Updated 4 months ago
- ☆47Updated 3 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated last month
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆36Updated 2 weeks ago
- ☆86Updated 7 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆21Updated last week
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆21Updated 3 weeks ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆57Updated 3 weeks ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 4 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆32Updated last month
- RuleR: Improving LLM Controllability by Rule-based Data Recycling☆12Updated last month
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated 9 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆25Updated 2 weeks ago