camel-ai / VLM-Play-StarCraft2Links
☆31Updated 3 weeks ago
Alternatives and similar repositories for VLM-Play-StarCraft2
Users that are interested in VLM-Play-StarCraft2 are comparing it to the libraries listed below
Sorting:
- Bayes-Adaptive RL for LLM Reasoning☆44Updated 8 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆39Updated last year
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆55Updated last month
- Natural Language Reinforcement Learning☆101Updated 6 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆26Updated 11 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆41Updated 6 months ago
- implementation of dualformer☆24Updated 11 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 5 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆66Updated 3 weeks ago
- ☆70Updated last year
- ☆51Updated 8 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆150Updated 4 months ago
- Reinforcement Learning via Regressing Relative Rewards☆38Updated last year
- DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning☆166Updated 2 months ago
- ☆67Updated 10 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Updated 5 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 10 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆26Updated last year
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Updated 6 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- ☆31Updated last year
- ☆75Updated 2 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆28Updated 10 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Updated last year
- Verlog: A Multi-turn RL framework for LLM agents☆67Updated 2 weeks ago
- ☆35Updated 10 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated last year
- ☆53Updated 11 months ago
- ☆132Updated 2 months ago