ZJU-REAL / OmniEmbodiedLinks
Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.
☆42Updated 3 months ago
Alternatives and similar repositories for OmniEmbodied
Users that are interested in OmniEmbodied are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Updated last month
- ☆36Updated last month
- ☆31Updated 3 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆37Updated 2 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆47Updated 5 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆47Updated 3 months ago
- Training VLM agents with multi-turn reinforcement learning☆324Updated 3 weeks ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆73Updated 5 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆66Updated 6 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆291Updated last month
- A comprehensive collection of process reward models.☆122Updated last month
- ICLR 2025 Agent-Related Papers☆72Updated last year
- ☆20Updated 3 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆51Updated 3 weeks ago
- A Self-Training Framework for Vision-Language Reasoning☆87Updated 10 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆90Updated 3 weeks ago
- Official Repository of "Learning what reinforcement learning can't"☆69Updated 2 weeks ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Updated 5 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆42Updated 2 months ago
- ☆46Updated last month
- ☆63Updated last month
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆43Updated 3 months ago
- Towards a Unified View of Large Language Model Post-Training☆187Updated 2 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆138Updated 6 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆153Updated 11 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆209Updated last month
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆54Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated last year
- ☆168Updated last month
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆166Updated 5 months ago