ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 8 months ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆192Updated 3 months ago
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆330Updated this week
- Training VLM agents with multi-turn reinforcement learning☆381Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆281Updated 11 months ago
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆105Updated 2 weeks ago
- ☆412Updated 11 months ago
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆264Updated 3 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆405Updated last year
- llm & rl☆268Updated 3 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆54Updated 9 months ago
- ☆102Updated last week
- ☆136Updated last year
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆186Updated 4 months ago
- Open Platform for Embodied Agents☆339Updated last year
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆147Updated 9 months ago
- ICLR 2025 Agent-Related Papers☆75Updated last year
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆199Updated 6 months ago
- A curated list of visual reinforcement learning resources☆462Updated 2 months ago
- ☆118Updated 9 months ago
- ☆489Updated 3 months ago
- modern AI for beginners☆191Updated 4 months ago
- minimal-cost for training 0.5B R1-Zero☆805Updated 8 months ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆171Updated last year
- ☆222Updated last month
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆154Updated 3 weeks ago
- A small open source 3D agent simulator based on LLM.☆69Updated last year
- ☆88Updated last year
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆314Updated 6 months ago
- The development and future prospects of large multimodal reasoning models.☆579Updated 3 weeks ago
- A Telegram bot to recommend arXiv papers☆302Updated 2 months ago