balrog-ai / BALROGLinks
Benchmarking Agentic LLM and VLM Reasoning On Games
☆228Updated this week
Alternatives and similar repositories for BALROG
Users that are interested in BALROG are comparing it to the libraries listed below
Sorting:
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆185Updated 8 months ago
- ☆110Updated last year
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆350Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆153Updated last year
- A Gym for Agentic LLMs