HumanCompatibleAI / human_ai_robustness
☆21Updated 4 years ago
Related projects: ⓘ
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆55Updated 2 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 4 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆47Updated last year
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Updated 3 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆32Updated 3 years ago
- ☆27Updated last year
- ☆35Updated 2 years ago
- ☆44Updated last year
- An RL-Friendly Vision-Language Model for Minecraft☆24Updated last month
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆29Updated 4 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆40Updated last year
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆36Updated 3 years ago
- Web application where humans can play Overcooked with AI agents.☆55Updated last year
- Implements the Messenger environment and EMMA model.☆22Updated last year
- MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations☆82Updated last year
- ☆18Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- ☆18Updated 3 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 3 years ago
- A paper list of sample-efficient reinforcement learning☆12Updated 2 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 2 months ago
- My Body Is A Cage☆37Updated 3 years ago
- Implementation of Random Expert Distillation☆29Updated 5 years ago
- Generalised UDRL☆37Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆62Updated 3 years ago