ndrwmlnk / critic-guided-segmentation-of-rewarding-objects-in-first-person-views
Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:
☆12Updated 2 years ago
Alternatives and similar repositories for critic-guided-segmentation-of-rewarding-objects-in-first-person-views:
Users that are interested in critic-guided-segmentation-of-rewarding-objects-in-first-person-views are comparing it to the libraries listed below
- ForgER algorithm☆22Updated 2 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆42Updated last year
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- My Body Is A Cage☆39Updated 3 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆13Updated 2 years ago
- Collection of reinforcement learning algorithms☆15Updated 3 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆17Updated last year
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- ☆13Updated last year
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago
- Revisiting Rainbow☆74Updated 3 years ago
- ☆42Updated 4 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning.☆23Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Behavioural cloning solution to MineRL2020 competition☆16Updated 3 years ago
- Conservative Q learning in Jax☆52Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆34Updated last year
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 11 months ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Official release of CompoSuite, a compositional RL benchmark☆47Updated last year