twni2016 / Memory-RLLinks
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆65Updated last year
Alternatives and similar repositories for Memory-RL
Users that are interested in Memory-RL are comparing it to the libraries listed below
Sorting:
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 3 years ago
- Benchmarked implementations of Offline RL Algorithms.☆75Updated 6 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆37Updated 6 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated 2 years ago
- Synthetic Experience Replay☆102Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆88Updated 9 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 9 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- ☆57Updated 2 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated last month
- Implementation of SAC and TD3 based on various RNN and Transformer.☆22Updated 11 months ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆21Updated 4 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆63Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 4 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated last month
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- ☆43Updated 2 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆42Updated 3 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆18Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- Simple maze environments using mujoco-py☆57Updated last year
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 10 months ago