NKAI-Decision-Team / HEP-LLM-play-StarCraftII
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
☆44Updated 6 months ago
Alternatives and similar repositories for HEP-LLM-play-StarCraftII:
Users that are interested in HEP-LLM-play-StarCraftII are comparing it to the libraries listed below
- TextStarCraft2,a pure language env which support llms play starcraft2☆261Updated 4 months ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆120Updated this week
- Reinforcement learning and planning for Minecraft.☆177Updated last year
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆96Updated last month
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆140Updated 7 months ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆37Updated 3 weeks ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆181Updated last year
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆71Updated 2 weeks ago
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆109Updated 7 months ago
- Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…☆38Updated last month
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆110Updated 7 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆35Updated 11 months ago
- ☆101Updated 2 weeks ago
- ☆132Updated 4 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 5 months ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆367Updated last year
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆341Updated 4 months ago
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆87Updated 3 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆173Updated last month
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆34Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆265Updated 5 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆72Updated last month
- This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…☆16Updated 10 months ago
- ☆51Updated 7 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆158Updated 4 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆255Updated 3 weeks ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆98Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆136Updated last year
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆274Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆107Updated this week