ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 7 months ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆188Updated 2 months ago
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆327Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆278Updated 10 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆54Updated 8 months ago
- Training VLM agents with multi-turn reinforcement learning☆365Updated last week
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆146Updated 9 months ago
- llm & rl☆266Updated 2 months ago
- 青稞Talk☆184Updated this week
- ☆480Updated 3 months ago
- A Telegram bot to recommend arXiv papers☆298Updated 2 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆404Updated last year
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆198Updated 5 months ago
- The development and future prospects of large multimodal reasoning models.☆568Updated 5 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆375Updated this week
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆169Updated last year
- ☆409Updated 11 months ago
- ☆208Updated 2 months ago
- Cool Papers - Immersive Paper Discovery☆683Updated 4 months ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆16Updated 7 months ago
- minimal-cost for training 0.5B R1-Zero☆799Updated 7 months ago
- A curated list of visual reinforcement learning resources☆454Updated last month
- ☆118Updated 9 months ago
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆52Updated last month
- ☆104Updated last month
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆185Updated 3 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆574Updated 8 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆154Updated 3 months ago
- Open Platform for Embodied Agents☆336Updated 11 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆398Updated 3 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆620Updated 9 months ago