jbr-ai-labs / mamba
This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
☆46Updated last year
Related projects: ⓘ
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆43Updated 6 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆46Updated last year
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆28Updated 11 months ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆16Updated 2 years ago
- Synthetic Experience Replay☆62Updated 3 months ago
- The Starcraft Multi-Agent challenge lite☆32Updated last week
- Challenging Memory-based Deep Reinforcement Learning Agents☆76Updated 3 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- Learning diverse options through the Laplacian representation.☆22Updated 8 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆67Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆24Updated 2 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆24Updated 3 months ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆65Updated this week
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆20Updated 11 months ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆11Updated 4 months ago
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆19Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆51Updated last month
- Benchmarked implementations of Offline RL Algorithms.☆62Updated 4 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆42Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆137Updated 3 months ago
- ☆21Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- Repo for Implicit Diffusion Q-Learning☆85Updated 9 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Updated 2 years ago