Aaron617 / ICLR-2025-Submissions-AgentLinks
ICLR 2025 Agent-Related Papers
☆70Updated 7 months ago
Alternatives and similar repositories for ICLR-2025-Submissions-Agent
Users that are interested in ICLR-2025-Submissions-Agent are comparing it to the libraries listed below
Sorting:
- A comprehensive collection of process reward models.☆92Updated 2 weeks ago
- ☆44Updated 3 weeks ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆57Updated 8 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆125Updated last week
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆36Updated 3 weeks ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆265Updated last week
- ☆77Updated 10 months ago
- ☆136Updated 6 months ago
- ☆169Updated this week
- ☆242Updated last month
- ☆46Updated last week
- RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆49Updated 3 weeks ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆136Updated last week
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆144Updated 7 months ago
- ☆130Updated 11 months ago
- HAZARD challenge☆35Updated last month
- [CVPR2024] This is the official implement of MP5☆102Updated 11 months ago
- RFTT: Reasoning with Reinforced Functional Token Tuning☆27Updated 2 weeks ago
- The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usa…☆28Updated 3 months ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆43Updated 7 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆240Updated 3 weeks ago
- Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory☆233Updated this week
- A paper list for spatial reasoning☆94Updated 2 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆252Updated 2 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆191Updated this week
- ☆222Updated this week
- Towards Large Multimodal Models as Visual Foundation Agents☆216Updated 2 months ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆164Updated 2 weeks ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆135Updated 2 weeks ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆209Updated 3 months ago