yunshiuan / tomnet-project
This repo contains the ToMnet+ model for preference inference. Developed by Yun-Shiuan, Edwinn, Hsin-Yi, and Elaine.
☆10Updated last year
Related projects: ⓘ
- The Implementation of "Machine Theory of Mind", ICML 2018☆20Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 4 months ago
- ☆26Updated 4 years ago
- ☆34Updated last year
- Object Centric Atari games☆43Updated this week
- ☆53Updated 2 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆64Updated 10 months ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆17Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆16Updated last year
- ☆28Updated 2 years ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆27Updated 3 years ago
- Change-Based Exploration Transfer☆35Updated 2 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 7 months ago
- Reinforcement Learning with Latent Flow☆42Updated 3 years ago
- Deep Hierarchical Planning from Pixels☆85Updated last year
- ☆20Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆46Updated 3 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆35Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆13Updated 2 months ago
- Behavioural cloning experiments with video games☆30Updated 4 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- ☆52Updated 8 months ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆32Updated last year
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆30Updated last year
- ☆19Updated 5 months ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆100Updated 2 years ago