NKAI-Decision-Team / HEP-LLM-play-StarCraftII
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
☆34Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for HEP-LLM-play-StarCraftII
- TextStarCraft2,a pure language env which support llms play starcraft2☆211Updated last month
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆86Updated 3 weeks ago
- Reinforcement learning and planning for Minecraft.☆158Updated 8 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆192Updated this week
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆151Updated 11 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆204Updated last month
- ☆89Updated 3 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆32Updated 6 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆87Updated 7 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆93Updated 2 months ago
- Align Anything: Training All-modality Model with Feedback☆248Updated last week
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆109Updated 4 months ago
- Paper collections of the continuous effort start from World Models.☆140Updated 4 months ago
- ☆228Updated 3 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆108Updated 7 months ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆105Updated 2 months ago
- A collection of LLM with RL papers☆230Updated 6 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆47Updated last month
- ProAgent: Building Proactive Cooperative Agents with Large Language Models☆60Updated 7 months ago
- Code for Contrastive Preference Learning (CPL)☆154Updated 9 months ago
- AI Alignment: A Comprehensive Survey☆128Updated last year
- Building open-ended embodied agent in battle royale FPS game☆34Updated 9 months ago
- ☆114Updated 4 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆321Updated last month
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆255Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆114Updated 2 months ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆138Updated last month
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆74Updated 9 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆199Updated 3 months ago
- ☆14Updated last month