tencent-ailab / hokoffLinks
☆54Updated 8 months ago
Alternatives and similar repositories for hokoff
Users that are interested in hokoff are comparing it to the libraries listed below
Sorting:
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆35Updated 2 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 10 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆192Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- ☆49Updated 4 months ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆138Updated 5 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆262Updated 6 months ago
- NeurIPS 2024 DACER☆139Updated last month
- TextStarCraft2,a pure language env which support llms play starcraft2☆288Updated 5 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆390Updated 9 months ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆45Updated 11 months ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆94Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆57Updated last year
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆34Updated 8 months ago
- ☆31Updated 2 years ago
- ☆62Updated 10 months ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆155Updated 7 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆165Updated 3 months ago
- ☆112Updated 5 months ago
- ☆84Updated 2 years ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆488Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆377Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆609Updated 9 months ago
- ☆53Updated 3 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆49Updated last year
- Online Preference Alignment for Language Models via Count-based Exploration☆16Updated 8 months ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆26Updated 9 months ago
- ☆87Updated 2 months ago
- [ICLR 2024] Official Implementation of ACORM☆59Updated last year