ndrwmlnk / critic-guided-segmentation-of-rewarding-objects-in-first-person-viewsLinks
Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:
☆13Updated 3 years ago
Alternatives and similar repositories for critic-guided-segmentation-of-rewarding-objects-in-first-person-views
Users that are interested in critic-guided-segmentation-of-rewarding-objects-in-first-person-views are comparing it to the libraries listed below
Sorting:
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ForgER algorithm☆22Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆24Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 4 years ago
- ☆43Updated 4 years ago
- ☆45Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 2 years ago
- Revisiting Rainbow☆75Updated 3 years ago
- ☆13Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- My Body Is A Cage☆41Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆50Updated 3 years ago
- Change-Based Exploration Transfer☆36Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Learning from Trajectories via Subgoal Discovery☆12Updated 4 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆77Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Updated 5 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 4 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago