NKAI-Decision-Team / HEP-LLM-play-StarCraftII
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
☆32Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for HEP-LLM-play-StarCraftII
- TextStarCraft2,a pure language env which support llms play starcraft2☆209Updated 3 weeks ago
- Reinforcement learning and planning for Minecraft.☆156Updated 8 months ago
- This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Co…☆71Updated 4 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆201Updated last month
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆80Updated 2 weeks ago
- ☆86Updated 3 months ago
- ☆17Updated 3 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆47Updated last month
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆32Updated 6 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆190Updated 5 months ago
- ☆113Updated 4 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆92Updated 2 months ago
- Align Anything: Training All-modality Model with Feedback☆224Updated last week
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆151Updated 10 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆114Updated last week
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆87Updated 6 months ago
- ☆16Updated 7 months ago
- ☆40Updated 11 months ago
- Implementation of the MATRIX framework (ICML 2024)☆39Updated 6 months ago
- Playing Hollow Knight with reinforcement learning.☆62Updated last year
- ☆52Updated this week
- [CVPR2024] This is the official implement of MP5☆83Updated 4 months ago
- AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs☆71Updated 2 months ago
- AI Alignment: A Comprehensive Survey☆128Updated last year
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆93Updated 2 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆254Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆301Updated 3 weeks ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆132Updated last month
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆104Updated 4 months ago