tencent-ailab / hokoffLinks
☆52Updated 5 months ago
Alternatives and similar repositories for hokoff
Users that are interested in hokoff are comparing it to the libraries listed below
Sorting:
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆33Updated 2 months ago
- ☆48Updated 2 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 8 months ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆135Updated 2 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆59Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆187Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆256Updated 4 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆143Updated last month
- TextStarCraft2,a pure language env which support llms play starcraft2☆282Updated 2 months ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆38Updated last year
- ☆30Updated 2 years ago
- NeurIPS 2024 DACER☆127Updated last week
- ☆52Updated last month
- ☆62Updated 8 months ago
- ☆79Updated last year
- ☆66Updated last week
- official implementation of QVPO☆44Updated 9 months ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆25Updated 7 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆57Updated last year
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆16Updated 11 months ago
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆42Updated 7 months ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆31Updated 6 months ago
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆42Updated last year
- ☆44Updated last year
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆31Updated last year
- ☆107Updated 3 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆449Updated 10 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- ☆93Updated last year