ShwStone / TRex-PPOLinks
Run TRex with PPO
☆39Updated 6 months ago
Alternatives and similar repositories for TRex-PPO
Users that are interested in TRex-PPO are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆184Updated last month
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆70Updated last week
- llm & rl☆258Updated last month
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆304Updated this week
- Training VLM agents with multi-turn reinforcement learning☆338Updated last week
- ☆400Updated 10 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆52Updated 7 months ago
- ☆60Updated 5 months ago
- Cool Papers - Immersive Paper Discovery☆663Updated 3 months ago
- A Telegram bot to recommend arXiv papers