microsoft / segar
Sandbox environment for generalizable agent research
☆23Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for segar
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- ☆37Updated 2 years ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆42Updated last year
- ☆36Updated last year
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆41Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆21Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆12Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 9 months ago
- My Body Is A Cage☆38Updated 3 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Object Centric Atari games☆48Updated this week
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆17Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆68Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- ☆30Updated 3 months ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago