init0xyz / AdaCQRLinks
Implementation of AdaCQR(COLING 2025)
☆10Updated 5 months ago
Alternatives and similar repositories for AdaCQR
Users that are interested in AdaCQR are comparing it to the libraries listed below
Sorting:
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 6 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆29Updated 3 weeks ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆8Updated 6 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆78Updated 5 months ago
- ☆19Updated last month
- Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?☆29Updated 3 weeks ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆27Updated last month
- ☆46Updated 8 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 7 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆16Updated last year
- ☆12Updated last year
- ☆17Updated 2 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆13Updated 7 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated 10 months ago
- instruction-following benchmark for large reasoning models☆34Updated last month
- The rule-based evaluation subset and code implementation of Omni-MATH☆22Updated 6 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆18Updated last week
- ☆24Updated 2 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆55Updated 11 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 5 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 7 months ago
- ☆19Updated 6 months ago
- ☆16Updated 7 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆60Updated 6 months ago
- ☆11Updated 7 months ago
- LightThinker: Thinking Step-by-Step Compression☆59Updated 2 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆26Updated last week
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Updated last year
- ☆30Updated 6 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated last month