ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 2 months ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆143Updated last month
- Open Platform for Embodied Agents☆324Updated 6 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆36Updated 3 months ago
- ☆55Updated last week
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆146Updated last month
- ☆348Updated 5 months ago
- llm & rl☆158Updated this week
- ☆72Updated this week
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆119Updated 10 months ago
- ☆90Updated 9 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆133Updated 3 months ago
- A curated list of visual reinforcement learning resources☆319Updated 3 weeks ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆372Updated 7 months ago
- Cool Papers - Immersive Paper Discovery☆572Updated last month
- ☆186Updated this week
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆174Updated last month
- Awesome RL Reasoning Recipes ("Triple R")☆745Updated last month
- ☆43Updated 3 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆151Updated this week
- A Telegram bot to recommend arXiv papers☆276Updated 3 months ago
- Collect every awesome work about r1!☆395Updated 2 months ago
- GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.☆787Updated this week
- [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey☆447Updated 6 months ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆280Updated 2 weeks ago
- ICLR 2025 Agent-Related Papers☆70Updated 8 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆137Updated 3 months ago
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆274Updated last week
- ☆194Updated 3 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆269Updated last month
- The development and future prospects of multimodal reasoning models.☆436Updated this week