ndrwmlnk / critic-guided-segmentation-of-rewarding-objects-in-first-person-views
Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:
☆13Updated 2 years ago
Alternatives and similar repositories for critic-guided-segmentation-of-rewarding-objects-in-first-person-views:
Users that are interested in critic-guided-segmentation-of-rewarding-objects-in-first-person-views are comparing it to the libraries listed below
- ForgER algorithm☆22Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- ☆42Updated 4 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆44Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆80Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 2 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆67Updated 5 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Revisiting Rainbow☆74Updated 3 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- ☆26Updated 2 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- My Body Is A Cage☆39Updated 3 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆13Updated 2 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆37Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Updated 2 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 3 years ago
- Learning from Trajectories via Subgoal Discovery☆12Updated 4 years ago