luchang1113 / HEP-LLM-play-StarCraftII
☆26Updated last month
Related projects: ⓘ
- TextStarCraft2,a pure language env which support llms play starcraft2☆192Updated last month
- AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs☆63Updated last month
- Playing Hollow Knight with reinforcement learning.☆60Updated last year
- Reinforcement learning and planning for Minecraft.☆151Updated 6 months ago
- ☆16Updated last month
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆62Updated 2 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆141Updated 9 months ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆12Updated 6 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆37Updated 2 weeks ago
- ☆16Updated 5 months ago
- HAZARD challenge☆25Updated 4 months ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆89Updated 2 weeks ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆81Updated 5 months ago
- A collection of LLM with RL papers☆213Updated 4 months ago
- Align Anything: Training All-modality Model with Feedback☆100Updated last week
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆248Updated last year
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆85Updated last week
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆89Updated 2 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆22Updated 5 months ago
- [ACL'2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆44Updated last month
- AI Alignment: A Comprehensive Survey☆123Updated 10 months ago
- ☆11Updated 11 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆31Updated 4 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆176Updated last week
- ☆52Updated 8 months ago
- ☆102Updated 2 months ago
- Baseline for NeurIPS_Auto_Bidding_General_Track☆20Updated last month
- Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…☆29Updated 3 weeks ago
- ☆139Updated 2 months ago