SijiaCui / play-urtsLinks
☆15Updated last year
Alternatives and similar repositories for play-urts
Users that are interested in play-urts are comparing it to the libraries listed below
Sorting:
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆640Updated last month
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆326Updated 2 weeks ago
- A list of awesome papers on LLM tool learning.☆26Updated last year
- Building a comprehensive and handy list of papers for GUI agents☆544Updated 2 weeks ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆566Updated 3 months ago
- ☆369Updated 3 weeks ago
- ☆174Updated 10 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆327Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆676Updated 9 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆139Updated 2 weeks ago
- ICLR 2025 Agent-Related Papers☆72Updated 11 months ago
- This is the repository for the Tool Learning survey.☆452Updated 3 months ago
- A version of verl to support diverse tool use☆654Updated 2 weeks ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆379Updated last year
- ☆307Updated 5 months ago
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆205Updated 4 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆357Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆360Updated last month
- ☆415Updated 3 weeks ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 6 months ago
- Training VLM agents with multi-turn reinforcement learning☆293Updated 2 weeks ago
- A comprehensive collection of process reward models.☆116Updated last month
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆192Updated 6 months ago
- The related works and background techniques about Openai o1☆223Updated 10 months ago
- ☆228Updated 2 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆241Updated 6 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆38Updated last year
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆158Updated last year
- A Survey on Large Language Model-Based Game Agents☆743Updated last month
- ☆548Updated 10 months ago