tyq1024 / RLx2
☆29Updated last year
Alternatives and similar repositories for RLx2:
Users that are interested in RLx2 are comparing it to the libraries listed below
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆72Updated 2 years ago
- ☆29Updated 2 years ago
- ☆18Updated 6 months ago
- ☆88Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆130Updated 4 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆69Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- ☆11Updated last year
- ☆42Updated 2 years ago
- Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)☆44Updated 2 months ago
- ☆42Updated 2 years ago
- ☆58Updated 3 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆45Updated 2 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆19Updated 5 months ago
- This is the official implementation of ERL-Re2.☆62Updated 8 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆28Updated 2 months ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆58Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆35Updated 11 months ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆93Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆92Updated 2 weeks ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆25Updated 2 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 3 months ago
- Decision Transformer: A brand new Offline RL Pattern.☆34Updated 3 years ago
- ☆28Updated 11 months ago
- ☆57Updated last month