Aaron617 / ICLR-2025-Submissions-AgentLinks
ICLR 2025 Agent-Related Papers
☆74Updated last year
Alternatives and similar repositories for ICLR-2025-Submissions-Agent
Users that are interested in ICLR-2025-Submissions-Agent are comparing it to the libraries listed below
Sorting:
- Training VLM agents with multi-turn reinforcement learning☆342Updated 2 weeks ago
- A comprehensive collection of process reward models.☆127Updated 2 months ago
- ☆187Updated 11 months ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆50Updated last year
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆227Updated last month
- [ICML 2025] Official Implementation of GLIDER☆70Updated 2 months ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Updated 5 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆60Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆373Updated 3 weeks ago
- ☆82Updated last year
- Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆32Updated 3 weeks ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆191Updated 11 months ago
- Paper collections of the continuous effort start from World Models.☆190Updated last year
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆47Updated 3 weeks ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆238Updated 2 weeks ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆77Updated 5 months ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆48Updated 3 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆245Updated 7 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆82Updated 6 months ago
- [NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.☆87Updated last week
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆273Updated 9 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆398Updated 5 months ago
- Official Repository of "Learning what reinforcement learning can't"☆70Updated 3 weeks ago
- ☆118Updated 8 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆139Updated 6 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆292Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆144Updated last month
- ☆319Updated 6 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆216Updated last month
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆90Updated 7 months ago