tencent-ailab / hokoffLinks
☆55Updated 10 months ago
Alternatives and similar repositories for hokoff
Users that are interested in hokoff are comparing it to the libraries listed below
Sorting:
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆39Updated 4 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆63Updated 2 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆194Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆143Updated 7 months ago
- ☆49Updated 7 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆274Updated 9 months ago
- NeurIPS 2024 DACER☆152Updated 2 months ago
- ☆33Updated 2 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆50Updated last year
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆35Updated 11 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆186Updated 2 months ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆40Updated last year
- ☆98Updated 2 weeks ago
- ☆46Updated 2 years ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆401Updated last year
- ☆63Updated last year
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆28Updated last year
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆530Updated last month
- TextStarCraft2,a pure language env which support llms play starcraft2☆293Updated 7 months ago
- Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication☆543Updated 2 weeks ago
- ☆118Updated 8 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆52Updated last year
- ☆55Updated 6 months ago
- Online Preference Alignment for Language Models via Count-based Exploration☆17Updated 11 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆165Updated 2 years ago
- ☆411Updated last year