ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated last month
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆130Updated 2 weeks ago
- ☆15Updated 7 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆131Updated 2 months ago
- ☆172Updated this week
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆134Updated 3 weeks ago
- A small open source 3D agent simulator based on LLM.☆66Updated 6 months ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆164Updated 2 weeks ago
- ☆82Updated 8 months ago
- llm & rl☆151Updated this week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆216Updated 4 months ago
- ☆191Updated 2 months ago
- ☆40Updated last week
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆116Updated 9 months ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆246Updated this week
- A Telegram bot to recommend arXiv papers☆275Updated 2 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆186Updated 3 months ago
- ☆337Updated 4 months ago
- ☆547Updated this week
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆240Updated 3 weeks ago
- ☆106Updated 2 months ago
- A curated list of visual reinforcement learning resources☆306Updated this week
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆75Updated 3 weeks ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆130Updated 2 months ago
- ☆242Updated last month
- 解锁HuggingFace生态的百般用法☆91Updated 6 months ago
- https://hnlp.boyuai.com☆93Updated 8 months ago
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆268Updated this week
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆84Updated 3 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆169Updated last year
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆206Updated this week