51616 / marl-lipo
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆14Updated 4 months ago
Related projects: ⓘ
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆19Updated 5 months ago
- Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023☆3Updated last year
- ☆12Updated 6 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆33Updated last year
- ☆37Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- ☆23Updated 2 years ago
- ☆15Updated 4 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆31Updated 6 months ago
- ☆20Updated 11 months ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆32Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆24Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆26Updated last month
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" ICLR 2022☆13Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆21Updated last year
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆12Updated last year
- ☆14Updated 2 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆23Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆24Updated last year
- Docker containers of baseline agents for the Crafter environment☆27Updated 2 years ago
- Source code to reproduce experiments from Mendez et al., ICLR '22☆20Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 5 months ago
- ☆21Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆29Updated last year
- ☆16Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated last year