lezhang7 / RearankLinks
[EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent
☆32Updated 4 months ago
Alternatives and similar repositories for Rearank
Users that are interested in Rearank are comparing it to the libraries listed below
Sorting:
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆58Updated last month
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 5 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Updated 7 months ago
- ☆67Updated 4 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆82Updated 2 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆40Updated 8 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆98Updated last year
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆60Updated 6 months ago
- Geometric-Mean Policy Optimization☆96Updated last month
- Scaling Preference Data Curation via Human-AI Synergy☆135Updated 6 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆69Updated 7 months ago
- ☆36Updated 3 months ago
- ☆16Updated last year
- ☆17Updated 5 months ago
- ☆41Updated 4 months ago
- ☆51Updated 8 months ago
- ☆53Updated 10 months ago
- ☆53Updated 10 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 6 months ago
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆27Updated 2 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆56Updated 7 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆83Updated last week
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 4 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆64Updated 2 months ago
- ☆50Updated 10 months ago
- ☆46Updated 4 months ago