NKAI-Decision-Team / HEP-LLM-play-StarCraftIILinks
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
☆52Updated last year
Alternatives and similar repositories for HEP-LLM-play-StarCraftII
Users that are interested in HEP-LLM-play-StarCraftII are comparing it to the libraries listed below
Sorting:
- TextStarCraft2,a pure language env which support llms play starcraft2☆289Updated 6 months ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆140Updated 6 months ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆158Updated last year
- ☆194Updated 3 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆43Updated 3 weeks ago
- Official implementation of the NeurIPS 2024 paper CORY☆22Updated 8 months ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆191Updated last year
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆123Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆360Updated last month
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆379Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆287Updated 11 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆194Updated last year
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 6 months ago
- ☆162Updated 9 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆366Updated 3 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆241Updated 6 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆91Updated 7 months ago
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆16Updated 7 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆177Updated 3 weeks ago
- A collection of LLM with RL papers☆278Updated last year
- ☆174Updated 10 months ago
- ☆95Updated last year
- Reinforced Multi-LLM Agents training☆56Updated 5 months ago
- ☆23Updated last year
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆396Updated 10 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆38Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆139Updated 2 weeks ago
- ☆307Updated 5 months ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆47Updated 3 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆326Updated last week