[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.
☆891Nov 16, 2025Updated 4 months ago
Alternatives and similar repositories for GamingAgent
Users that are interested in GamingAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Resources for the Enigmata Project.☆81Aug 13, 2025Updated 7 months ago
- ☆59May 21, 2025Updated 10 months ago
- A Survey on Large Language Model-Based Game Agents☆852Feb 13, 2026Updated last month
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆337May 30, 2025Updated 9 months ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆98Mar 5, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆359Mar 18, 2026Updated last week
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆26May 21, 2025Updated 10 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,553Mar 15, 2026Updated last week
- Benchmarking Agentic LLM and VLM Reasoning On Games☆240Updated this week
- ☆66Feb 4, 2026Updated last month
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆60Dec 18, 2025Updated 3 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆9,231Updated this week
- Democratizing Reinforcement Learning for LLMs☆5,259Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,097Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Aug 30, 2024Updated last year
- Verlog: A Multi-turn RL framework for LLM agents☆71Mar 11, 2026Updated 2 weeks ago
- Official repository of the NeurIPS 2025 Competition: The PokeAgent Challenge: Competitive and Long-Context Learning at Scale. (Track 2, S…☆79Mar 17, 2026Updated last week
- Official Repo for Open-Reasoner-Zero☆2,086Jun 2, 2025Updated 9 months ago
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life☆367Dec 2, 2024Updated last year
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆407Dec 15, 2024Updated last year
- ☆32May 31, 2025Updated 9 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 3 months ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆479Mar 22, 2025Updated last year
- Minimal reproduction of DeepSeek R1-Zero☆12,963Feb 27, 2026Updated 3 weeks ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,713Updated this week
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,368Mar 15, 2026Updated last week
- s1: Simple test-time scaling☆6,646Jun 25, 2025Updated 9 months ago
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,903Dec 17, 2025Updated 3 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆383Mar 7, 2025Updated last year
- Paper List of Minecraft Agents☆58Mar 6, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆179Sep 18, 2025Updated 6 months ago
- Official implementation of TBA for async LLM post-training.☆29Nov 5, 2025Updated 4 months ago
- Witness the aha moment of VLM with less than $3.☆4,041May 19, 2025Updated 10 months ago
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆839Feb 11, 2026Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,821Mar 18, 2025Updated last year
- Recipes to train reward model for RLHF.☆1,521Apr 24, 2025Updated 11 months ago
- Minimalistic large language model 3D-parallelism training☆2,617Feb 19, 2026Updated last month