ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 3 weeks ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆121Updated this week
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆122Updated last week
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆157Updated 2 weeks ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆107Updated 8 months ago
- ICLR 2025 Agent-Related Papers☆71Updated 6 months ago
- ☆78Updated 8 months ago
- ☆151Updated last week
- ☆38Updated 3 weeks ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆129Updated last month
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆82Updated 2 months ago
- ☆216Updated 2 weeks ago
- ☆329Updated 3 months ago
- A curated list of visual reinforcement learning resources☆282Updated 2 weeks ago
- llm & rl☆139Updated this week
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆212Updated this week
- 在没有sudo权限的情况下,在linux上使用clash☆106Updated 6 months ago
- Open Platform for Embodied Agents☆318Updated 4 months ago
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆192Updated last week
- https://hnlp.boyuai.com☆93Updated 8 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆266Updated last week
- ☆269Updated last week
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆264Updated last month
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabi…☆88Updated 5 months ago
- ☆189Updated last month
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆134Updated this week
- A Telegram bot to recommend arXiv papers☆272Updated last month
- A small open source 3D agent simulator based on LLM.☆65Updated 6 months ago
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆136Updated 3 weeks ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆30Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆203Updated 3 months ago