ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 4 months ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆168Updated 3 months ago
- Official Repository for PosterGen☆149Updated 3 weeks ago
- ☆384Updated 8 months ago
- ☆408Updated last month
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆189Updated 2 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆172Updated 2 weeks ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆143Updated 6 months ago
- llm & rl☆222Updated 3 weeks ago
- ☆220Updated last week
- ICLR 2025 Agent-Related Papers☆74Updated 10 months ago
- ☆41Updated 4 months ago
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆210Updated this week
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆145Updated this week
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆138Updated last year
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆12Updated last year
- A Telegram bot to recommend arXiv papers☆282Updated 5 months ago
- Open Platform for Embodied Agents☆330Updated 8 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆256Updated 7 months ago
- ☆59Updated 3 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆285Updated last week
- RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforce…☆459Updated last week
- ☆51Updated 6 months ago
- ☆30Updated 11 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- MLLM @ Game☆14Updated 4 months ago
- The development and future prospects of multimodal reasoning models.☆508Updated 2 months ago
- A curated list of visual reinforcement learning resources☆404Updated 2 weeks ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆364Updated 3 months ago
- ☆160Updated 3 weeks ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆46Updated 5 months ago