luchang1113 / HEP-LLM-play-StarCraftII

☆26

Related projects: ⓘ

histmeisah / Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
☆192Updated last month
sbl1996 / ygo-agent
AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs
☆63Updated last month
seermer / HollowKnight_RL
Playing Hollow Knight with reinforcement learning.
☆60Updated last year
PKU-RL / Plan4MC
Reinforcement learning and planning for Minecraft.
☆151Updated 6 months ago
xihuai18 / arxiv-sanity-x
☆16Updated last month
ZO-Bench / ZO-LLM
[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆62Updated 2 months ago
liziniu / ReMax
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
☆141Updated 9 months ago
jidiai / Competition_AAMAS2023
source code for AAMAS 2023 Imperfect-information Card Game Competition
☆12Updated 6 months ago
stevenyangyj / Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
☆37Updated 2 weeks ago
LR32768 / DL_theory_exp
☆16Updated 5 months ago
UMass-Foundation-Model / HAZARD
HAZARD challenge
☆25Updated 4 months ago
BAAI-Agents / GPA-LM
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…
☆89Updated 2 weeks ago
louieworth / awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
☆81Updated 5 months ago
floodsung / LLM-with-RL-papers
A collection of LLM with RL papers
☆213Updated 4 months ago
PKU-Alignment / align-anything
Align Anything: Training All-modality Model with Feedback
☆100Updated last week
CraftJarvis / MC-Planner
Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…
☆248Updated last year
bigai-ai / civrealm
CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.
☆85Updated last week
Vance0124 / Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆89Updated 2 months ago
ZJLAB-AMMI / LLM4Teach
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆22Updated 5 months ago
ZHZisZZ / modpo
[ACL'2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
☆44Updated last month
PKU-Alignment / AlignmentSurvey
AI Alignment: A Comprehensive Survey
☆123Updated 10 months ago
Guangxuan-Xiao / GSM8K-eval
☆11Updated 11 months ago
OpenDFM / Rememberer
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆31Updated 4 months ago
RL4VLM / RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆176Updated last week
shiqiangw / iclr2024-scores
☆52Updated 8 months ago
szxiangjn / world-model-for-language-model
☆102Updated 2 months ago
alimama-tech / NeurIPS_Auto_Bidding_General_Track_Baseline
Baseline for NeurIPS_Auto_Bidding_General_Track
☆20Updated last month
tencent-ailab / mini-hok
Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…
☆29Updated 3 weeks ago
cmnfriend / O-LoRA
☆139Updated 2 months ago