camel-ai / VLM-Play-StarCraft2Links
☆28Updated 6 months ago
Alternatives and similar repositories for VLM-Play-StarCraft2
Users that are interested in VLM-Play-StarCraft2 are comparing it to the libraries listed below
Sorting:
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆44Updated 4 months ago
- Natural Language Reinforcement Learning☆97Updated last month
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆29Updated last month
- Bayes-Adaptive RL for LLM Reasoning☆39Updated 3 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆61Updated 8 months ago
- An RL-Friendly Vision-Language Model for Minecraft☆36Updated 11 months ago
- implementation of dualformer☆20Updated 6 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 10 months ago
- ☆34Updated 6 months ago
- ☆48Updated 4 months ago
- ☆63Updated 6 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆66Updated 5 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆88Updated last week
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Updated 3 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆117Updated last week
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆37Updated last year
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆13Updated 2 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆23Updated last month
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆29Updated 9 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆40Updated 2 months ago
- ☆62Updated this week
- DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning☆155Updated last month
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆95Updated 3 months ago
- ☆67Updated last year
- ☆112Updated 5 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 6 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- ☆44Updated last year