OceanGPT / OceanGymLinks
OceanGym: A Benchmark Environment for Underwater Embodied Agents
☆85Updated 2 weeks ago
Alternatives and similar repositories for OceanGym
Users that are interested in OceanGym are comparing it to the libraries listed below
Sorting:
- ICLR 2025 Agent-Related Papers☆75Updated last year
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆62Updated last month
- ☆270Updated 5 months ago
- ☆198Updated last year
- A comprehensive collection of process reward models.☆134Updated 3 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆290Updated 2 months ago
- A paper list of Awesome Latent Space.☆305Updated last week
- Monitor Google Scholar author citation counts and track changes automatically without opening tabs.☆68Updated 5 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆310Updated 3 weeks ago
- ☆303Updated 6 months ago
- Training VLM agents with multi-turn reinforcement learning☆381Updated this week
- A Collection of Papers about Memory for Language Agents☆289Updated last week
- A curated list of personalized alignment resources (continually updated).☆56Updated 3 months ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Updated 7 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆83Updated last month
- ☆213Updated 6 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆271Updated 3 weeks ago
- [AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆39Updated 2 months ago
- CycleResearcher: Improving Automated Research via Automated Review☆325Updated 6 months ago
- ✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models☆253Updated 7 months ago
- 学术双语简历模板,涵盖教育背景、论文发表、项目经历、竞赛经历和个人陈述等关键部分,可适用于申请研究生项目、学术职位或相关行业岗位。☆172Updated 7 months ago
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆264Updated 3 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆411Updated 6 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆62Updated last year
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Updated 3 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆404Updated 3 months ago
- ☆137Updated last month
- SePer is an accurate / fast / free-of-API metric to measure document quality via information gain☆29Updated 4 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆601Updated 6 months ago
- ☆83Updated last year