(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆19Nov 22, 2025Updated 4 months ago
Alternatives and similar repositories for AgentRefine
Users that are interested in AgentRefine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 5 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆41Jun 24, 2025Updated 8 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Jul 4, 2025Updated 8 months ago
- TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice☆22Nov 24, 2025Updated 3 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆55Aug 28, 2025Updated 6 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆42Aug 25, 2025Updated 6 months ago
- ☆67Aug 14, 2025Updated 7 months ago
- ☆16Sep 17, 2024Updated last year
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆87Updated this week
- [CVPR 2025 Highlight] InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment☆44Jun 29, 2025Updated 8 months ago
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆32Nov 4, 2025Updated 4 months ago
- ☆58Feb 27, 2025Updated last year
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆34Aug 20, 2025Updated 7 months ago
- [AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?☆26Dec 14, 2025Updated 3 months ago
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆85Jul 26, 2025Updated 7 months ago
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆68Dec 8, 2025Updated 3 months ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 10 months ago
- ☆13Nov 23, 2022Updated 3 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 8 months ago
- ☆41Updated this week
- ☆33May 27, 2025Updated 9 months ago
- [MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images☆15Mar 12, 2026Updated last week
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- ☆17May 31, 2023Updated 2 years ago
- The official repo for the DanQing dataset.☆32Jan 16, 2026Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆713Oct 15, 2025Updated 5 months ago
- ☆120May 26, 2025Updated 9 months ago
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆23Mar 8, 2026Updated 2 weeks ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69May 13, 2025Updated 10 months ago
- The Source Code for OmniVideoBench @ICLR 2026☆64Feb 12, 2026Updated last month
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆42May 3, 2025Updated 10 months ago
- ICDE 2024 Paper, MetaSQL: A Generate-then-Rank Framework for Natural Language to SQL Translation☆26May 9, 2025Updated 10 months ago
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆31Feb 24, 2026Updated 3 weeks ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆78Sep 12, 2025Updated 6 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- Pathology Foundation Models Meet Semantic Segmentation☆30Feb 9, 2026Updated last month
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 5 months ago