tencent-ailab / hokoffLinks

☆52

Alternatives and similar repositories for hokoff

Users that are interested in hokoff are comparing it to the libraries listed below

Sorting:

devindeng94 / smac-hard
Enabling Mixed Opponent Strategy Script and Self-play on SMAC
☆33Updated 2 months ago
tencent-ailab / marl-mini
☆48Updated 2 months ago
pickxiguapi / Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…
☆36Updated 8 months ago
NKAI-Decision-Team / LLM-PySC2
LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…
☆135Updated 2 months ago
OpenRL-Lab / TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆59Updated last year
PKU-RL / Plan4MC
[NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
☆187Updated last year
mohmdelsayed / streaming-drl
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆256Updated 4 months ago
qiwang067 / LS-Imagine
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
☆143Updated last month
histmeisah / Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
☆282Updated 2 months ago
elicassion / StARformer
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
☆95Updated 2 years ago
pickxiguapi / Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆38Updated last year
tyq1024 / RLx2
☆30Updated 2 years ago
happy-yan / DACER-Diffusion-with-Online-RL
NeurIPS 2024 DACER
☆127Updated last week
TobiasLv / RAD
☆52Updated last month
tinnerhrhe / MTDiff
☆62Updated 8 months ago
yuqingd / ellm
☆79Updated last year
MyRepositories-hub / Simple-Policy-Optimization
☆66Updated last week
wadx2019 / qvpo
official implementation of QVPO
☆44Updated 9 months ago
CJReinforce / JOWA
Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"
☆25Updated 7 months ago
maohangyu / TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
☆57Updated last year
PKU-RL / AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆16Updated 11 months ago
NVlabs / DRAIL
[NeurIPS'24] The Official PyTorch implementation of DRAIL
☆42Updated 7 months ago
charleshsc / QT
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆31Updated 6 months ago
cccedric / cpql
This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".
☆42Updated last year
PKU-RL / Creative-Agents
☆44Updated last year
liuqh16 / MAZero
Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.
☆31Updated last year
DigiRL-agent / digiq
☆107Updated 3 months ago
WindyLab / LLM-RL-Papers
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆449Updated 10 months ago
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
weipu-zhang / STORM
☆93Updated last year