znowu / mirror-learning
The code for experiments conducted to verify the correctness of mirror learning.
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for mirror-learning
- ☆38Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- ☆15Updated 3 months ago
- ☆29Updated 3 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Episodic Control☆19Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆12Updated 2 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆19Updated 2 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- My Body Is A Cage☆38Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated last year
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆29Updated 3 years ago
- ☆36Updated last year
- ☆29Updated last year
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆50Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated last year
- Generalised UDRL☆37Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14Updated 3 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago