51616 / marl-lipoLinks
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19Updated last year
Alternatives and similar repositories for marl-lipo
Users that are interested in marl-lipo are comparing it to the libraries listed below
Sorting:
- ☆42Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆26Updated 2 years ago
- Simple maze environments using mujoco-py☆57Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆86Updated 8 months ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- ☆47Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆104Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆75Updated 5 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆63Updated last year
- ☆17Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆20Updated 2 years ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 8 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆78Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆50Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆81Updated 2 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆84Updated 2 years ago
- ☆48Updated 2 years ago
- ☆49Updated 3 weeks ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 weeks ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆90Updated last year