HumanCompatibleAI / human_ai_robustness
☆21Updated 4 years ago
Alternatives and similar repositories for human_ai_robustness:
Users that are interested in human_ai_robustness are comparing it to the libraries listed below
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆56Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Minimal code for A Generalist Agent☆38Updated 2 years ago
- ☆33Updated last year
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆31Updated last year
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- ☆40Updated 3 years ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆72Updated 2 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Updated 2 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- Web application where humans can play Overcooked with AI agents.☆57Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆69Updated 2 weeks ago
- My Body Is A Cage☆39Updated 3 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆44Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 3 years ago
- ☆42Updated 4 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆65Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated last month