NKAI-Decision-Team / HEP-LLM-play-StarCraftIILinks
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
☆49Updated 9 months ago
Alternatives and similar repositories for HEP-LLM-play-StarCraftII
Users that are interested in HEP-LLM-play-StarCraftII are comparing it to the libraries listed below
Sorting:
- TextStarCraft2,a pure language env which support llms play starcraft2☆285Updated 3 months ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆138Updated 3 months ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆150Updated 11 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆38Updated 5 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆281Updated 8 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆268Updated 3 weeks ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆189Updated last year
- Official implementation of the NeurIPS 2024 paper CORY☆18Updated 5 months ago
- ☆152Updated 7 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆282Updated 2 years ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆38Updated 2 months ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆372Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆185Updated 3 months ago
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆117Updated 11 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆34Updated last year
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆189Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆85Updated 11 months ago
- A collection of LLM with RL papers☆276Updated last year
- ☆21Updated 9 months ago
- Reinforced Multi-LLM Agents training☆35Updated last month
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆144Updated 7 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆85Updated 3 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆41Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆143Updated 5 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆80Updated 2 months ago
- ☆258Updated 2 months ago
- ☆109Updated 4 months ago
- ☆161Updated 2 weeks ago
- AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models☆89Updated 5 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆147Updated 9 months ago