phy-q / benchmark
Phy-Q: A Testbed for Physical Reasoning
☆44Updated 9 months ago
Alternatives and similar repositories for benchmark:
Users that are interested in benchmark are comparing it to the libraries listed below
- Model-Based Visual Planning with Self-Supervised Functional Distances (ICLR 2021)☆20Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Change-Based Exploration Transfer☆36Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- ☆42Updated 4 years ago
- ☆44Updated last year
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 4 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆20Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- ☆42Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 6 months ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- ☆15Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆54Updated 3 years ago