tencent-ailab / hokoffLinks
☆56Updated 11 months ago
Alternatives and similar repositories for hokoff
Users that are interested in hokoff are comparing it to the libraries listed below
Sorting:
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆40Updated 5 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆63Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆143Updated 8 months ago
- ☆49Updated 7 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆293Updated 8 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆272Updated 9 months ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆194Updated last year
- ☆33Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- ☆99Updated 3 weeks ago
- NeurIPS 2024 DACER☆153Updated 2 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated 2 years ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆187Updated 2 months ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆28Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆639Updated last year
- official implementation of QVPO☆56Updated 2 weeks ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆52Updated last year
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆40Updated last year
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆35Updated 11 months ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆170Updated 10 months ago
- On-Policy Policy Gradient Algorithms in JAX☆42Updated last year
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆533Updated last month
- ☆88Updated 2 years ago
- ☆63Updated last year
- [NeurIPS 2024] Official Implementation of Meta-DT☆50Updated last year
- ☆118Updated 8 months ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆79Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆384Updated last year