ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 6 months ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆181Updated last month
- ☆125Updated last year
- llm & rl☆246Updated 3 weeks ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆51Updated 7 months ago
- Training VLM agents with multi-turn reinforcement learning☆304Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆265Updated 9 months ago
- ☆396Updated 9 months ago
- A Telegram bot to recommend arXiv papers☆287Updated last week
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆14Updated last year
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆179Updated last month
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆398Updated 11 months ago
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆232Updated last month
- A curated list of visual reinforcement learning resources☆433Updated last month
- 青稞Talk☆161Updated last week
- Qwen2.5 0.5B GRPO☆71Updated 9 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆196Updated 4 months ago
- modern AI for beginners☆174Updated 2 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆145Updated 7 months ago
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆41Updated last month
- ☆423Updated last month
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆332Updated last week
- Open Platform for Embodied Agents☆333Updated 10 months ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆16Updated 5 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆146Updated last month
- The development and future prospects of large multimodal reasoning models.☆545Updated 3 months ago
- ☆52Updated last year
- Cool Papers - Immersive Paper Discovery☆648Updated 2 months ago
- Official Repository for PosterGen☆182Updated last month
- minimal-cost for training 0.5B R1-Zero☆785Updated 6 months ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆397Updated last month